Modeling Genome Coverage in Metagenome

来源 :第七届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:hurukun
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Metagenome sequencing is a key technology for studying microbiome.A single metagenome sample usually contains millions of short reads from diverse species,with different genome length and abundance.The high similarity between different organism genomes along with the sequence bias will cause uneven coverage in a single genome.We investigated the uneven coverage in mock community data as well as really metagenome data and got some interesting observations.Experiments showed that the coverage bias will have influence on the downstream analysis such as using partial genome coverage to predict the genome length1.A probabilistic method was also developed to model and correct the coverage bias.
其他文献
RNA-binding proteins (RBPs) play critical roles in various biological processes.More and more RBPs are found recently.Identifying RBPs by computational prediction is still challenging.In this study,we
会议
S-palmitoylation is a key regulatory mechanism controlling protein targeting,localization,stability and activity.Since increasing evidence shows that its disruption is implicated in many human disease
Although non-coding DNA sequences do not encode proteins[1,2],more and more studies show that non-coding DNA plays an indispensable role in other aspects[3].In this paper,variant maps[4,5]are applied
Over sufficiently long genomic sequence,strand symmetry is a ubiquitous and explicit phenomenon.Despite being studied over two decades,the exact mechanism involved in strand-symmetry has not yet been
肉苁蓉(学名:Cistanche deserticola Ma)属于肉苁蓉属列当科,素有“沙漠人参”之美誉,具有极高的药用价值。多年来,肉苁蓉的研究多集中在其药用价值的研发、生物活性成分的分离鉴定以及人工栽培等方面,而对其遗传物质及其分子水平的研究鲜有报道。肉苁蓉是多年生专性全寄生性草本植物,专性寄生于藜科小乔木梭梭(Haloxylonammodendron)根部,而梭梭是适于生长在沙漠地区的抗旱
Coding/Non-coding genomic sequences[1]play a central role in modem Bioinformatics and System Biology especially for diagnosis of cancers & diseases base on genomic data sequences acquisition collected
Chor et al found that tetrapods animals (including all mammals) the frequency distribution of k-mer is showing multiple peaks.If the k-mer according to the number it contains CG dinucleotide classific
Bacterial pathogens secret numerous proteins,the effectors,in order to adapt to the new environment or promote virulence by the bacterium-host interactions.The mechanisms of secretion of effectors thr
Data quality and peak alignment efficiency of ChIP-sequencing profiles are directly related to the reliability and reproducibility of NSG experiments.Till now,there is no tool specifically designed fo
会议
SAROTUP (Scanner And Reporter Of Target-Unrelated Peptides) 3.0 is a significant upgrade to the widely used SAROTUP web server for the rapid identification of target-unrelated peptides (TUPs) from bio