论文部分内容阅读
GenBank中35547条甜瓜EST经去冗余处理后,得到总长度为250.3Mb的无冗余EST34438条。这些序列中有2813个微卫星简单重复序列(Simple sequence repeat,SSR),分布于2107条EST中,出现频率为8.16%,平均分布距离为8.90kb。三核苷酸重复是主导重复类型,占SSR总数的47.14%;其次是二核苷酸和单核苷酸重复,分别占SSR总数的20.72%和16.99%。AAG/TTC是优势重复基元,占微卫星总数的29.26%,AG/CT和A/T分别占14.61%和16.25%。在所有的SSR中,重复次数为4~10次的占70.32%,长度为12~20bp的占51.12%。并对这些SSR的多态性潜能进行了评价。
35547 melon ESTs in GenBank were de-redundantly processed to yield non-redundant EST34438 with a total length of 250.3 Mb. There were 2813 Simple Sequence Repeats (SSRs) in these sequences distributed in 2107 ESTs, with a frequency of 8.16% and an average distribution distance of 8.90kb. Trinucleotide repeats were the dominant repeat types, accounting for 47.14% of the total SSRs; followed by dinucleotide and mononucleotide repeats, accounting for 20.72% and 16.99% of the total, respectively. AAG / TTC is the dominant repeat motif, accounting for 29.26% of the total number of microsatellites, AG / CT and A / T accounted for 14.61% and 16.25% respectively. In all SSRs, 70.32% were repeated 4 to 10 times and 51.12% were 12 to 20 bp in length. The polymorphism potential of these SSRs was evaluated.