论文部分内容阅读
尿道致病性大肠杆菌UPEC CFT073菌株(uropathogenic Escherichia coli CFT073)于2002年被完全测序并注释。但是,对其基因组的研究还很不完善,首先表现在基因组注释的系统性错误和滞后性。作者运用一系列生物信息学方法和工具,从编码蛋白质基因、编码RNA基因等角度对RefSeq数据库的基因组注释进行了系统的修正和增补,并在此基础上鉴别了一批新的候选致病因子基因。进一步的分析表明,得到的基因组注释对CFT073致病相关的一些重要调控关系和机制能够给出更准确、完整的描述。
Uropathogenic E. coli UPEC CFT073 strain (uropathogenic Escherichia coli CFT073) was completely sequenced and annotated in 2002. However, the study of its genome is still not perfect, first showing the systematic errors and lags of the genome annotation. Using a series of bioinformatics methods and tools, the authors systematically revised and supplemented the genome annotation of the RefSeq database from the perspectives of encoding protein genes and encoding RNA genes. On the basis of this, a number of new candidate pathogenic factors gene. Further analysis showed that the obtained genome annotation could provide more accurate and complete description of some of the important regulatory relationships and mechanisms involved in the pathogenesis of CFT073.