论文部分内容阅读
【背景】桑氏链霉菌(Streptomyces sampsonii)KJ40是一株具有防病、促生多重功能的放线菌,有作为生物农药的潜力。目前还没有相关研究报道S.sampsonii全基因组序列,这限制了其功能基因、代谢产物合成途径及比较基因组学等研究。【目的】解析S.sampsonii KJ40的基因组序列信息,以深入研究该菌株防病促生机制及挖掘次级代谢产物基因资源。【方法】利用Illumina Hiseq高通量测序平台对KJ40菌株进行全基因组测序,使用相关软件对测序数据进行基因组组装、基因预测和功能注释、预测次级代谢产物合成基因簇、共线性分析等。【结果】基因组最后得到9个Scaffolds和578个Contigs,总长度为7 261 502 bp,(G+C)%含量平均为73.41%,预测到6 605个基因、1 260个串联重复序列、804个小卫星序列、67个微卫星序列、90个t RNA、9个r RNA和19个s RNA。其中,2 429、3 765、2 890、6 063和1 911个基因分别能够在COG、GO、KEGG、NR和Swiss-Prot数据库提取到注释信息。同时,还预测得到21个次级代谢产物合成基因簇。基因组测序数据提交至NCBI获得Gen Bank登录号:LORI00000000。S.sampsonii KJ40与Streptomyces coelicolor A3(2)、Streptomyces griseus subsp.griseus NBRC 13350三株链霉菌基因组存在翻转、易位等基因组重排,3个基因组共有1 711个蛋白聚类簇。【结论】研究为从基因组层次上解析KJ40菌株具有良好促生防病效果的内在原因提供基础数据,为深入了解链霉菌次级代谢合成途径提供参考信息,对S.sampsonii后续相关研究具有重要意义。
【Background】 Streptomyces sampsonii KJ40 is an actinomycete with multiple functions of preventing disease and promoting growth. It has potential as a bio-pesticide. At present, there is no relevant study reported S. sarssonii whole genome sequence, which limits its functional genes, metabolic pathways and comparative genomics research. 【Objective】 The objective of this study is to analyze the genomic sequence of S. sampsonii KJ40 in order to further study the mechanism of disease-prevention and growth-promoting and to explore the resources of secondary metabolites. 【Method】 Whole genome sequencing of KJ40 strain was carried out by using Illumina Hiseq high-throughput sequencing platform. Genomic assembly, gene prediction and functional annotation of sequencing data were used to predict the sequence of secondary metabolites. 【Result】 The results showed that there were 9 Scaffolds and 578 Contigs with a total length of 7 261 502 bp and an average (G + C)% content of 73.41%. A total of 6 605 genes, 1 260 tandem repeats and 804 Small satellite sequences, 67 microsatellite sequences, 90 tRNAs, 9 rRNAs and 19 s RNAs. Among them, 2 429,3 765,2 890,6 063 and 1 911 genes were able to extract annotation information in COG, GO, KEGG, NR and Swiss-Prot databases, respectively. At the same time, 21 secondary metabolite synthetic gene clusters were also predicted. Genomic sequencing data submitted to NCBI Gen Bank accession number: LORI00000000. The genomes of S.sampsonii KJ40 and Streptomyces coelicolor A3 (2) and Streptomyces griseus subsp. Curlicus NBRC 13350 were overturned and translocated. The genomes of the three strains had 1 711 protein clusters. 【Conclusion】 This study provides the basic data for analyzing the intrinsic causes of KJ40 strain at the genomic level for promoting the growth and disease prevention. It is of great significance to further understand the secondary metabolic pathway of Streptomyces, .