【摘 要】
:
RNA-Seq has become one of the most widely used applications based on next-generation sequencing (NGS) technology.However,raw RNA-Seq data may have various quality issues,which can significantly distor
【机 构】
:
Key Laboratory for Sustainable Development of Marine Fisheries of Ministry of Agriculture,Yellow Sea
【出 处】
:
第七届全国生物信息学与系统生物学学术大会
论文部分内容阅读
RNA-Seq has become one of the most widely used applications based on next-generation sequencing (NGS) technology.However,raw RNA-Seq data may have various quality issues,which can significantly distort analytical results and lead to erroneous conclusions.Therefore,the raw data must be subjected to quality control procedures before downstream analysis.However,the existing QC tools for RNA-Seq data have various limitations like incompletefunctions,long running time and poor usability.Here we report RNA-QC-Chain,a comprehensive,fast and easy-to-use QC pipeline for RNA-Seq data,which involves three steps: (1) sequencing-quality assessment and trimming;(2)internal and external contamination filtering;(3) alignment statistics (such as read number,alignment coverage,sequencing depth and pair-end read mapping statistics).It also has several unique features that are not available in other RNA data QC tools,such as sequence trimming,automatic rRNA filtration and contaminating species identification,The three QC steps can run either sequentially or independently,enabling it a comprehensive package with high flexibility.Moreover,parallel computing is applied in most of the QC procedures,making it runs fast.The performance of RNA-QC-Chain has been evaluated with a RNA-Seq dataset of an algae species Nannochloropsis.Comparison of RNA-QC-Chain with other QC tools showed that it is superior in both function versatility and processing speed.
其他文献
In many informational fields,such as biological,environmental,medical,etc.,lots of sets of data are created every day.A flexible system biology tool enables to analysis the biology metabolize network
Anoikis resistance is a hallmark of cancer,and relates to malignant phenotypes,including cell migration,epithelial-mesenchymal transformation (EMT),metastasis and cancer stem cell maintenance.Anoikis
Identification of driver genes remains a critical challenge in the cancer genomics field.One reason may lie in that functionally important genes are more chance to be mutated in cancer genomics.The ac
p53 shows stimulus-dependent dynamics in the DNA damage response,and the dynamic modes of p53 is linked closely with the decision making between cell fates.On one hand,we developed a network model to
Discussed the visualization of the ECG signal based on the variable value measurement system,and made a comparison between the routine ECG and the ECG scatter point map for the diagnosis of P wave ano
Birds are the most species-rich class of tetrapod vertebrates[1].In all kinds of ecological environment,about 10,000 kinds of bird species distribute in the earth.Because the birds have experienced a
The current epidemic of Zika virus (ZIKV) in the Americas is particularly alarming given its confirmed neuropathological associations,such as fetal microcephaly and Guillain-Barre syndrome.As most of
Motivation: Recent studies have illustrated association between copy number variations (CNVs) and particular tumor types.By the help of different high-throughput technologies,such as array-based compa
Background: Neuropeptides (NPs) play critical roles in synaptic signaling in various systems.NPs act as hormone,modulator,neurotransmitter and cytokine to regulate broad functions.NPs share the common
Due to the extensive complexity and high genetic heterogeneity of genetic alterations in cancer,comprehensively depicting the molecular mechanisms of cancer remains difficult.Characterizing functional