论文部分内容阅读
采用新一代高通量测序技术平台Illumina Hiseq 2 000对云南松转录组测序,得到的数据进行de novo组装,获得80 000条Unigenes,N50为1 881 nt、平均890 nt。与公共数据库进行比对,注释到NR、NT、Swiss-Prot数据库的Unigenes分别为43 434、46 415、29 418条。将Unigenes与COG数据库比对,有14 792条Unigenes成功注释,根据功能大致分成25类;与GO数据库比对,有26 743条Unigenes获得注释,按功能分为细胞组分、分子功能和生物过程3大类55亚类,其中参与的生物过程较多;以KEGG数据库参考,有25 873条Unigenes参与128条代谢途径分支,以代谢相关的通路较为集中,并找到与木质素合成关键酶的Unigenes。这些研究极大地扩充了云南松的基因资源,将有助于云南松基因的发掘与利用、分子标记的开发及其种质资源遗传改良的研究等。
Illumina Hiseq 2000, a new generation high-throughput sequencing platform, was used to sequence the transcriptome of P. yunnanensis. The data was de novo assembled and 80 000 Unigenes were obtained. The N50 was 1881 nt with an average of 890 nt. Compared with public databases, Unigenes annotated to NR, NT and Swiss-Prot databases were 43 434,46 415,29 418 respectively. Unigenes was compared with the COG database, with 14,792 Unigenes successful annotations, which were divided into 25 categories according to their functions. Compared with the GO database, 26,743 Unigenes were annotated and divided into cell components, molecular functions and biological processes There were 25,873 Unigenes involved in 128 metabolic pathways, with a concentration of metabolism-related pathways. Unigenes, a key enzyme in the synthesis of lignin, . These studies have greatly expanded the genetic resources of Pinus yunnanensis, which will be helpful for the discovery and utilization of Pinus yunnanensis, the development of molecular markers and the genetic improvement of germplasm resources.