论文部分内容阅读
本研究采用Illumina Hiseq4000对金钗石斛(Dendrobium nobile)叶转录组进行测序,共获得5.6 Gb数据.组装并去冗余后得到61 998个Unigene,其总长度,平均长度,N50以及GC含量分别为53 773 338 bp、867 bp、1 482 bp和43.09%.将Unigene比对到七大功能数据库进行注释,最终分别有34 250 (NR:55.24%),28 010 (NT:45.18%)、22 029 (Swissprot:35.53%)、13 384 (COG:21.59%)、25 754 (KEGG:41.54%)、7 731 (GO:12.47)以及25 407 (Interpro:40.98%)个Unigene获得功能注释.在KEGG数据库中,注释上的与碳水化合物代谢、萜类和黄酮类化合物代谢、以及多糖合成相关的Unigene分别有2 819个、706个和559个.根据注释结果共检测出34 096个CDS,未注释上的Unigene使用ESTScan预测后获得2 108个CDS.检测出7 165个SSR分布于6 264个Unigene中,其中在不同重复基序类型中,出现频率最高的为AG/CT,其次是AAG/CTT、AT/AT和AGG/CCT.同时,预测出1 234个编码转录因子的Unigene.该转录组测序分析为金钗石斛次生代谢和转录组方面研究提供了一定的理论参考.“,”In this study,we obtained about 5.6 Gb data in total from leaves transcriptome of Dendrobium nobile by Illumina Hiseq4000 sequencing.After assembling and removing redundancy,61 998 Unigene were obtained,the total length,average length,N50,and GC content of Unigenes were 53 773 338 bp,867 bp,1 482 bp,and 43.09%,respectively.Then annotate Unigenes with 7 functional databases,and 34 250 (NR:55.24%),28 010 (NT:45.18%),22 029 (Swissprot:35.53%),13 384 (COG:21.59%),25 754 (KEGG:41.54%),7 731 (GO:12.47%),and 25 407 (Interpro:40.98%) Unigenes were annotated finally.KEGG pathway analysis showed that 2 819,706,and 559 Unigenes were idenfied as potentially involved in carbohydrate metabolism,metabolism of terpenoids and polykefides,glycan biosynthesis and metabolism,respectively.With functional annotation results,we detected 34 096 CDS,and the Unigene that were not annotated get 2 108 CDS after using the ESTScan prediction.7 165 SSR were detected in 6 264 Unigene,in which the highest frequency was AG/CT in different repeating motif types,followed by AAG/CTT,AT/AT and AGG/CCT.Meanwhile,1 234 Unigene encoding transcription factors were predicted.This transcriptome sequencing analysis could provide certain theoretical reference for the study of secondary metabolism and transcription group of Dendrobium nobile.