论文部分内容阅读
为了开发丹参表达序列标签微卫星(EST-SSR)功能性分子标记,从公共数据库GenBank中下载获得丹参EST序列10 288条,在剔除低质量和冗余的序列后,得到全长为292.561 kb的无冗余EST序列5 073条。在这些序列中共发掘出628个SSR,分布在528条EST中,出现频率是10.4%,平均长度为14.47 bp,平均分布频率为每0.47 kb分布一个SSR位点,其中含SSR位点的EST长度主要分布于401~600 bp与601~800 bp之间。在检索出的SSR s中,二核苷酸是主要重复类型,占SSR总数的72.45%;其次是三核苷酸,占SSR总数的26.75%。二核苷酸类型(AG)n、(AT)n和(AC)n和三核苷酸类型(AAG)n、(AGC)n基元是SSR的主要重复类型。本研究为丹参EST-SSR标记的开发和进一步应用提供参考。
In order to develop functional molecular markers of Salvia miltiorrhiza expressed sequence tagged microsatellite (EST-SSR), 10 288 ESTs of Salvia miltiorrhiza were obtained from the public database GenBank. After removing low quality and redundant sequences, the full-length cDNA was 292.561 kb There are 5 073 non-redundant EST sequences. A total of 628 SSRs were found among 528 ESTs in these sequences, with a frequency of 10.4% and an average length of 14.47 bp. The average distribution frequency was one SSR locus per 0.47 kb. The length of ESTs containing SSR loci Mainly distributed in 401 ~ 600 bp and 601 ~ 800 bp between. Among the SSRs retrieved, dinucleotides were the major repeat types, accounting for 72.45% of the total SSRs; followed by trinucleotides, accounting for 26.75% of the total SSRs. Dinucleotide types (AG) n, (AT) n and (AC) n and trinucleotide type (AAG) n, (AGC) n motifs are the major repeat types of SSRs. This study provides a reference for the development and further application of Salvia miltiorrhiza EST-SSR marker.