论文部分内容阅读
全长cDNA对基因组学和蛋白质组学研究有着非常重要的价值.以分离籼稻基因组全长cDNA为目标,从优良的籼稻恢复系明恢63中分离到10828条非冗余的全长cDNA,其中780条是新的水稻表达序列.所得到的全长cDNA至少满足以下两个条件之一:(i)5’端序列包含粳稻日本晴的全长cDNA所预测的完整的ORF(9078条);(ii)包含同源蛋白质对应的完整N末端编码序列(6543条).在分离到的全长cDNA中,53%的序列比报道的粳稻全长cDNA有更长的5’端非翻译区(5’UTR);90.28%(9776条)的序列能定位到粳稻基因组序列上,92.78%(10046条)的序列可以定位到籼稻基因组序列上;8216条序列能与日本晴的全长cDNA序列定位到粳稻基因组序列的同一位置上,籼粳间cDNA序列的平均相似性为99.2%;90%以上的全长cDNA能进行GO (gene ontology)分类.在780条新的cDNA中,60%以上的找不到任何同源蛋白序列.
Full-length cDNA has very important value in genomics and proteomics research.In order to isolate the full-length cDNA of indica genome, 10828 non-redundant full-length cDNAs were isolated from elite indica rice restorer Minghui63 780 is a new rice expression sequence, and the resulting full-length cDNA meets at least one of the following two conditions: (i) the complete ORF (9078) predicted by the full-length cDNA with the 5 ’end sequence comprising Japonica Nipponbare; ii) contains the complete N-terminal coding sequence corresponding to the homologous protein (6543). Of the isolated full-length cDNAs, 53% had a longer 5 ’untranslated region than the full- UTR); 90.28% (9776) sequences could be mapped to the japonica genome sequence, 92.78% (10046) sequences could be mapped to the indica genome sequence; 8216 sequences could be mapped with the full-length cDNA sequence of Nipponbare At the same position in the genome sequence, the average similarity of cDNA sequences between indica and japonica was 99.2%; more than 90% of the full-length cDNAs could be classified as GO (gene ontology). Of the 780 new cDNAs, more than 60% To any homologous protein sequence.