Understanding Long-Range Correlations in DNA-Sequences

Li, W. T., Marr, T. G., Kaneko, K. (August 1994) Understanding Long-Range Correlations in DNA-Sequences. Physica D, 75 (1-3). pp. 392-416. ISSN 0167-2789

Abstract

In this paper, we review the literature on statistical long-range correlation in DNA sequences. We examine the current evidence for these correlations, and conclude that a mixture of many length scales (including some relatively long ones) in DNA sequences is responsible for the observed 1/f-like spectral component. We note the complexity of the correlation structure in DNA sequences. The observed complexity often makes it hard, or impossible, to decompose the sequence into a few statistically stationary regions. We suggest that, based on the complexity of DNA. sequences, a fruitful approach to understand long-range correlation is to model duplication, and other rearrangement processes, in DNA sequences. One model, called ''expansion-modification system'', contains only point duplication and point mutation. Though simplistic, this model is able to generate sequences with 1/f spectra. We emphasize the importance of DNA duplication in its contribution to the observed long-range correlation in DNA sequences.

Item Type: Paper
Uncontrolled Keywords: PROTEIN CODING REGIONS POWER-LAW CORRELATIONS YEAST CHROMOSOME-III SPATIAL 1/F SPECTRA NUCLEOTIDE-SEQUENCE 1/F-ALPHA SPECTRUM MUTUAL INFORMATION SYSTEMS EVOLUTION AUTOMATA
Subjects: bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification
Investigative techniques and equipment > assays > whole genome sequencing
organism description > yeast
CSHL Authors:
Communities: CSHL labs
Depositing User: Matt Covey
Date: August 1994
Date Deposited: 01 May 2015 16:55
Last Modified: 01 May 2015 16:55
URI: https://repository.cshl.edu/id/eprint/31490

Actions (login required)

Administrator's edit/view item Administrator's edit/view item