一种面向大规模序列数据的交互特征并行挖掘算法(96)

来源 :第二届中国计算机学会生物信息学会议 | 被引量 : 0次 | 上传用户:lm403379799
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  序列是一种重要的数据类型,在诸多应用领域广泛存在。基于序列的特征选择具有广阔的现实应用场景。交互特征是指一组整体具有显著强于单独个体与目标相关性的特征集合。从大规模序列中挖掘交互特征面临着位点的“组合爆炸”问题,计算挑战性极大。针对该问题,以生物领域高通量测序数据为背景,提出了一种新的基于并行处理和演化计算的高阶交互特征挖掘算法。位点数是制约互作挖掘效率的根本因素。本研究摈弃了现有方法基于序列分块的并行策略,采用基于位点分块的并行思想,具有天然的效率优势。
其他文献
Simulating multi-scale dynamics of complex living systems is the major challenge in the researches of computational system biology.In this work,we propose a CUDA-based generic multi-cellular biologica
会议
Ribosome stalling is manifested by the local accumulation of ribosomes at specific codon positions of mRNAs.Here,we present ROSE,a deep learning framework to analyze high-throughput ribosome profiling
会议
Docker 应用容器引擎可实现打包生物信息数据流应用程序以及依赖包到一个可移植的容器中,然后部署到任何主流的 Linux 机器上。本实验室利用Docker 技术结合make 搭建面向RNA-Seq、全基因组重测序、Pacbio 三代全长转录组测序等生物信息分析软件工作流程的Docker 容器。产出的大型工作流可以实现RNA-Seq 表达差异分析及GO、KEGG 等相关注释分析,同时能实现对fus
会议
Multi-view classification and feature selection have received considerable attention in recent years.In many real classification problems,the data in each view may have noise.The low-rank regression m
会议
Defining informative features from complex and high dimensional biological data is of great importance in disease study,drug development,etc.Support vector machine-recursive feature elimination(SVM-RF
会议
Practical live-cell super-resolution(SR)techniques are long-desired in many routine biological labs to image biomolecule dynamics.However,the current methods either require sophisticated optical setup
会议
Graph canonization is a fundamental problem both in theoretical and practical computer science.However,it is still an open problem to study in graph theory.In this paper,we propose a new graph canoniz
会议
图聚类算法可以用于发现社会网络中的社区结构、蛋白质互作用网络的功能模块等,是当前复杂网络研究的热点之一。合理度量网络中节点的相似性是设计有效图聚类算法的核心问题。针对此问题,本文提出了一种基于两点间短路径的节点相似性度量方法,并在此基础上给出了一种面向复杂网络的图聚类算法(A Graph Clustering Algorithm Based on Pathsbetween Nodes in Com
会议
Big data,cloud computing and HPC are at the verge of convergence.Cloud computing is already playing an active part in big data processing with the help of big data frameworks like Hadoop and Spark.Rec
会议
The automatic detection of the diabetic retinopathy is of importance,as it is the main cause of irreversible vision lost in the working-age population in the developed world.Early detection of diabeti
会议