Expanding and Vetting Sorghum bicolor Gene Annotations through Transcriptome and Methylome Sequencing

Olson, A., Klein, R. R., Dugas, D. V., Lu, Z. Y., Regulski, M., Klein, P. E., Ware, D. (July 2014) Expanding and Vetting Sorghum bicolor Gene Annotations through Transcriptome and Methylome Sequencing. Plant Genome, 7 (2). p. 20. ISSN 1940-3372

Abstract

With the emergence and subsequent advancement of next-generation sequence technology, detailed structural and functional characterization of genomes is readily attainable. Here, we have sampled the Sorghum bicolor methylome by shallow sequencing of HSO3- (bisulfite)-treated DNA and have used these data to identify methylation patterns associated with high confidence gene models. We trained a classifier to predict functional gene models based on expression levels, methylation profiles, and sequence conservation. We have expanded the transcriptome atlas by sequencing RNA from meristematic tissues, florets, and embryos, and utilized this information to develop a more complete annotation of the sorghum transcriptome. Our gene annotations modify 60% of Sbi1.4 (version 1.4 of sorghum gene annotations) gene models. The updated models most often have extended untranslated region (UTR) annotations (18,105), but some show longer protein coding regions (5096) or previously unannotated alternative transcripts (6493). A phylogenetic analysis suggests that 800 genes are missing from annotation Sbi1.4 and 400 gene models are split. The new annotations resolve 50% of split gene models and include 30% of conserved genes missing from the Sbi1.4 annotation. Using our classifier, we identified a large set of 34,276 novel potentially functional transcribed regions. These transcribed regions include protein coding genes, non-coding RNAs, and other classes of gene products.

Item Type: Paper
Uncontrolled Keywords: NOVO DNA METHYLATION GENOME-WIDE ANALYSIS ARABIDOPSIS-THALIANA RNA-SEQ EPIGENETIC MODIFICATIONS RECIPROCAL HYBRIDS EXPRESSION ATLAS RICE RESOLUTION UNCOVERS
Subjects: bioinformatics
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > DNA methylation
bioinformatics > genomics and proteomics
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > genomes > genome annotation
organism description > plant
CSHL Authors:
Communities: CSHL labs > Ware lab
Depositing User: Matt Covey
Date: July 2014
Date Deposited: 08 Aug 2014 16:23
Last Modified: 08 Aug 2014 16:23
URI: https://repository.cshl.edu/id/eprint/30681

Actions (login required)

Administrator's edit/view item Administrator's edit/view item