Big Data: Astronomical or Genomical?

Stephens, Z. D., Lee, S. Y., Faghri, F., Campbell, R. H., Zhai, C., Efron, M. J., Iyer, R., Schatz, M. C., Sinha, S., Robinson, G. E. (July 2015) Big Data: Astronomical or Genomical? PLoS Biol, 13 (7). e1002195. ISSN 1545-7885 (Electronic)1544-9173 (Linking)

[thumbnail of Paper]
Preview
PDF (Paper)
Schatz PLoS Biology 2015.pdf - Published Version

Download (514kB) | Preview
URL: http://www.ncbi.nlm.nih.gov/pubmed/26151137
DOI: 10.1371/journal.pbio.1002195

Abstract

Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a "four-headed beast"-it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the "genomical" challenges of the next decade.

Item Type: Paper
Subjects: bioinformatics
bioinformatics > computational biology
bioinformatics > genomics and proteomics > datasets
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > genomes
Investigative techniques and equipment > assays > whole genome sequencing
CSHL Authors:
Communities: CSHL labs > Schatz lab
CSHL Cancer Center Program > Cancer Genetics
Depositing User: Matt Covey
Date: 7 July 2015
Date Deposited: 10 Jul 2015 19:20
Last Modified: 14 Oct 2015 20:46
PMCID: PMC4494865
Related URLs:
URI: https://repository.cshl.edu/id/eprint/31615

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving