Accounting for uncertainty in DNA sequencing data

O'Rawe, J. A., Ferson, S., Lyon, G. J. (January 2015) Accounting for uncertainty in DNA sequencing data. Trends Genet, 31 (2). pp. 61-68. ISSN 0168-9525

Abstract

Science is defined in part by an honest exposition of the uncertainties that arise in measurements and propagate through calculations and inferences, so that the reliabilities of its conclusions are made apparent. The recent rapid development of high-throughput DNA sequencing technologies has dramatically increased the number of measurements made at the biochemical and molecular level. These data come from many different DNA-sequencing technologies, each with their own platform-specific errors and biases, which vary widely. Several statistical studies have tried to measure error rates for basic determinations, but there are no general schemes to project these uncertainties so as to assess the surety of the conclusions drawn about genetic, epigenetic, and more general biological questions. We review here the state of uncertainty quantification in DNA sequencing applications, describe sources of error, and propose methods that can be used for accounting and propagating these errors and their uncertainties through subsequent calculations.

Item Type: Paper
Subjects: Investigative techniques and equipment > assays > next generation sequencing
Investigative techniques and equipment > assays > whole genome sequencing
CSHL Authors:
Communities: CSHL labs > Lyon lab
Stanley Institute for Cognitive Genomics
Depositing User: Matt Covey
Date: 8 January 2015
Date Deposited: 16 Jan 2015 20:59
Last Modified: 06 Nov 2015 20:43
Related URLs:
URI: https://repository.cshl.edu/id/eprint/31128

Actions (login required)

Administrator's edit/view item Administrator's edit/view item