CircAST:Full-length Assembly and Quantification of Alternatively Spliced Isoforms in Circular RNAs

来源 :基因组蛋白质组与生物信息学报(英文版) | 被引量 : 0次 | 上传用户:hlwang72
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Circular RNAs (circRNAs), covalently closed continuous RNA loops, are generated from cognate linear RNAs through back splicing events, and altative splicing events may gener-ate different circRNA isoforms at the same locus. However, the challenges of reconstruction and quantification of altatively spliced full-length circRNAs remain unresolved. On the basis of the intal structural characteristics of circRNAs, we developed CircAST, a tool to assemble alter-natively spliced circRNA transcripts and estimate their expression by using multiple splice graphs.Simulation studies showed that CircAST correctly assembled the full sequences of circRNAs with a sensitivity of 85.63%-94.32%and a precision of 81.96%-87.55%. By assigning reads to specific iso-forms, CircAST quantified the expression of circRNA isoforms with correlation coefficients of 0.85-0.99 between theoretical and estimated values. We evaluated CircAST on an in-house mouse testis RNA-seq dataset with RNase R treatment for enriching circRNAs and identified 380 cir-cRNAs with full-length sequences different from those of their corresponding cognate linear RNAs. RT-PCR and Sanger sequencing analyses validated 32 out of 37 randomly selected isoforms, thus further indicating the good performance of CircAST, especially for isoforms with low abundance. We also applied CircAST to published experimental data and observed substantial diversity in circular transcripts across samples, thus suggesting that circRNA expression is highly regulated. CircAST can be accessed freely at https://github.com/xiaofengsong/CircAST.
其他文献
高职高专办学充满机遇与挑战,通过测绘工程技术专业办学的探索和实践,在高职高专示范遴选专业新一轮的、以工学结合为主要目的的教学改革中,积极转变观念,本文在重新构建以职业素
会议
康复护理学是一门新兴的学科,目前正经历专业化知识的积累、传播和发展阶段。康复护理在临床护理学中占有重要的地位,护理人员可运用所学的理论知识为患者实施整体护理,帮助患者
会议
我国煤炭资源总量丰富,但南方煤炭缺乏,供需矛盾突出.基于南方缺煤省(区)煤炭资源调查评价项目成果,系统梳理了影响我国南方缺煤地区找煤的地质因素,根据不同地质条件提出了
Accurate identification of compound-protein interactions (CPIs) in silico may deepen our understanding of the underlying mechanisms of drug action and thus rema
多媒体作为现代信息技术发展的产物,已经为人们所接受。本文就其存在的必要性及其优势,如何在课堂中应用,以及避免多媒体教学的误区进行论述。 As a product of the develop
经济危机、地产泡沫,都在警示着人们——选项理性,回归实业!坐如康健康新家电源头净化专家提起坐如康环保健康新家电,好多人一头雾水,还不了解,但是提起空气净化机,人们都不
Objective To investigate prevalence rate of learning disabilities (LD) in Chinese children, and to explore related risk factors, and to provide theoretical bas
Next-generation sequencing (NGS) technologies generate thousands to millions of genetic variants per sample. Identification of potential disease-causal variants
我无限热爱我的祖国越南,我决不许美帝国主义践踏我的国土。在伟大的抗美救国斗争中,越南人民劁造出无数可歌可泣的英雄事迹,涌现出爍耀古今的英雄人物。他们是国家的脊梁,
Sequences of circular RNAs (circRNAs) produced from back-splicing of exon(s) com-pletely overlap with those from cognate linear RNAs transcribed from the same g