Li, W. T., Marr, T. G., Kaneko, K. (August 1994) Understanding Long-Range Correlations in DNA-Sequences. Physica D, 75 (1-3). pp. 392-416. ISSN 0167-2789
Abstract
In this paper, we review the literature on statistical long-range correlation in DNA sequences. We examine the current evidence for these correlations, and conclude that a mixture of many length scales (including some relatively long ones) in DNA sequences is responsible for the observed 1/f-like spectral component. We note the complexity of the correlation structure in DNA sequences. The observed complexity often makes it hard, or impossible, to decompose the sequence into a few statistically stationary regions. We suggest that, based on the complexity of DNA. sequences, a fruitful approach to understand long-range correlation is to model duplication, and other rearrangement processes, in DNA sequences. One model, called ''expansion-modification system'', contains only point duplication and point mutation. Though simplistic, this model is able to generate sequences with 1/f spectra. We emphasize the importance of DNA duplication in its contribution to the observed long-range correlation in DNA sequences.
Item Type: | Paper |
---|---|
Uncontrolled Keywords: | PROTEIN CODING REGIONS POWER-LAW CORRELATIONS YEAST CHROMOSOME-III SPATIAL 1/F SPECTRA NUCLEOTIDE-SEQUENCE 1/F-ALPHA SPECTRUM MUTUAL INFORMATION SYSTEMS EVOLUTION AUTOMATA |
Subjects: | bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification Investigative techniques and equipment > assays > whole genome sequencing organism description > yeast |
CSHL Authors: | |
Communities: | CSHL labs |
Depositing User: | Matt Covey |
Date: | August 1994 |
Date Deposited: | 01 May 2015 16:55 |
Last Modified: | 01 May 2015 16:55 |
URI: | https://repository.cshl.edu/id/eprint/31490 |
Actions (login required)
Administrator's edit/view item |