Statistical features of human exons and their flanking regions

Zhang, M. Q. (May 1998) Statistical features of human exons and their flanking regions. Human Molecular Genetics, 7 (5). pp. 919-32. ISSN 0964-6906 (Print)

URL: http://www.ncbi.nlm.nih.gov/pubmed/9536098
DOI: 10.1093/hmg/7.5.919

Abstract

To facilitate gene finding and for the investigation of human molecular genetics on a genome scale, we present a comprehensive survey on various statistical features of human exons. We first show that human exons with flanking genomic DNA sequences can be classified into 12 mutually exclusive categories. This classification could serve as a standard for future studies so that direct comparisons of results can be made. A database for eight categories (related to human genes in which coding regions are split by introns) was built from GenBank release 87.0 and analyzed by a number of methods to characterize statistical features of these sequences that may serve as controls or regulatory signals for gene expression. The statistical information compiled includes profiles of signals for transcription, splicing and translation, various compositional statistics and size distributions. Further analyses reveal novel correlations and constraints among different splicing features across an internal exon that are consistent with the Exon Definition model. This information is fundamental for a quantitative view of human gene organization, and should be invaluable for individual scientists to design human molecular genetics experiments.

Item Type: Paper
Uncontrolled Keywords: Adenine Base Composition Bayes Theorem Codon Confidence Intervals Cytosine Databases, Factual Exons/ genetics Guanine Humans Introns Reading Frames Research Support, U.S. Gov't, P.H.S. Sequence Analysis, DNA/methods/ statistics & numerical data Thymine
Subjects: bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > exons
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > genes, structure and function
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > introns
CSHL Authors:
Communities: CSHL labs > Zhang lab
Depositing User: Kathleen Darby
Date: May 1998
Date Deposited: 30 Apr 2014 20:14
Last Modified: 30 Apr 2014 20:14
Related URLs:
URI: http://repository.cshl.edu/id/eprint/29943

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving