论文部分内容阅读
High-throughput RNA sequencing was performed for comprehendsively analyzing the transcriptome of the geese.A total of 28,803,759 bp of raw sequence data were generated, of which 28,730,361bp remained after adaptor trimming (99% of sequenced bases).After removal of adaptor sequences, 117,279 reads were obtained, with an average length of 244 bases.Assembly of all the reads from both 454 and illumine platforms, formed 56,839 contigs, with 12,995 reads remaining as singletons, Ranging from 38 to 28206 bp in size, with an average size of 2584 bp and an N50 of 4624.The assembly produced a substantial number of large contigs: 35,545 (62.5%) were longer thanlkb, of which 8850 (15.6%) were longer than 5kb.The sequencing depth was 85 X on average.We performed comprehensive function annotations on unigenes including protein sequence similarity, GO term classification, and KEGG pathway enrichment.Gene ontology (GO) analysis showed that approximately 63% of the contigs had annotation information, among the 36,756 annotated sequences in Nr database, 24,299 (66.1%) sequences were assigned with one or more GO terms.There were 19,757 (81.3%) isotigs for biological processes, 10,643 (43.4%) isotigs for cellular component, 22,818 (93.9%) isotigs for molecular function.The result of KEGG pathway mapping 8,737 sequences had the pathway annotation, and took part in 477 pathways.In addition, different expression genes in ovarian tissue between different states of Sichuan White geese were detected in the current study.