论文部分内容阅读
水稻基因组序列中蕴藏着生理、遗传、发育、进化和与重要经济性状相关的许多生物学信息.籼稻亚种Oryza Sativa ssp.indica在中国和亚洲其他地方广泛种植.通过全基因组霰弹法,得到了这一亚种的基因组框架序列.共完成430万个成功反应,总读长为2214.9Mb,其中籼稻亚种93-11的成功反应有330万个,总读长1797.4Mb,初步拼接得到了409.76Mb的非冗余序列,大约覆盖了水稻全基因组的95.29%,碱基准确率大于99%.通过与公共数据库中籼稻indica和粳稻japonica亚种的BAC克隆比较,证实了基因组框架图的覆盖率、序列分布的随机性和拼接软件BIG-ASSEMBLER的准确性.在框架图中,鉴定了96.3%的全长cDNA,96.4%的遗传标记(STS,STR,RFLP),94.0%的EST和94.9%的单基因簇(unigene).初步分析表明,该框架序列已经达到了国际同行所认同的标准.框架图的公布无疑为水稻生物学研究提供了基因组学和遗传学方面的基本信息.
Rice genome sequences contain many biological information related to physiology, genetics, development, evolution, and important economic traits. Oryza sativa ssp. Indica is widely planted in China and other parts of Asia through genome-wide shotgun methods The genome sequence of this subspecies completed 4.3 million successful reactions with a total length of 2214.9 Mb, including 3.3 million successes of the indica subspecies 93-11, with a total length of 1797.4 Mb and a preliminary splicing of 409.76 Mb, which covers about 95.29% and 99% of the total genome of rice.Through the comparison with the BAC clones of Indica and Japonica subspecies in public databases, the coverage of the genome frame map was confirmed , Sequence distribution randomness and accuracy of the splicing software BIG-ASSEMBLER, 96.3% of the full-length cDNA, 96.4% of the genetic markers (STS, STR, RFLP), 94.0% (Unigene) .A preliminary analysis showed that the framework sequence has reached the standard agreed by international peers.The publication of the framework map undoubtedly provided genomics and heredity for rice biology research Basic information area.