Genome-wide Protein Function Prediction via Multi-instance Multi-label Active Learning

来源 :第七届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:hjzc800
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  As the number of sequenced genomes rapidly grows,there are a large number of proteins with unknown function in new genomes.Active learning methods can assist biologists for selecting the most valuable ones as candidates for biological experiments.Previously,it is proved that the protein function prediction task is naturally Multi-Instance Multi-Label (MIML) learning problem.In this paper,we formulate the problem of selecting the most valuable proteins in annotating genome-wide protein functions as a MIML active learning task,Then,we propose a MIML active learning framework named MIMLAL and design two algorithms MIMLAL-A and MIMLAL-R for genome-wide protein function prediction.
其他文献
Long intergenic non-coding RNAs (lincRNAs) may play widespread roles in biological processes,however,a systematic examination of the functions of lincRNAs in the biological responses of rice to phosph
会议
Among the identified thousands of circular RNAs (circRNA) in humans and animals,CDRlas (also known as CiRS-7) was recently demonstrated to act as a powerful miR-7 sponge.Here,we find CDR1as is downreg
Circular RNA are a new class of noncoding RNA in numerous species.They are relatively stable in cytoplasm thank to their loop structure.They are believed to have multiple potential functions,such as m
Acute myocardial infarction (AMI) is a kind of common disease of cardiovascular disease.During the course of AMI morbidity,it will lead to potential complication from the disease,like atrial fibrillat
Various types of mutation and editing (M/E) events in miRNAs can change the stabilities of pre-miRNAs and/or complementarities between miRNAs and their targets.Small RNA (sRNA) high-throughput sequenc
Protein-protein interacts through specific interface residues to execute biological functions.Correctly understanding the mechanisms of interface recognition and predicting the interface residues are
会议
The CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR-associated proteins) adaptive immune systems are discovered in many bacteria and most archaea.These defence systems are
Enzymatic catalysis mechanism,which is the central to understand the biogenesis and diversity of natural products,would shed bright light on the rational drug design and protein engineering.The biosyn
Modular protein interaction domains form the building blocks of cell signaling pathways.Many of them,known as peptide recognition domains,mediate protein-protein interactions by recognizing short,line
Predicting the biological functions of proteins is one of the key challenges in the post genomic era.Computational models have demonstrated the utility of applying machine learning methods to predict