Earl, D., Bradnam, K., St. John, J., Darling, A., Lin, D., Fass, J., Yu, H. O. K., Buffalo, V., Zerbino, D. R., Diekhans, M., Nguyen, N., Ariyaratne, P. N., Sung, W. K., Ning, Z., Haimel, M., Simpson, J. T., Fonseca, N. A., Birol, I., Docking, T. R., Ho, I. Y., Rokhsar, D. S., Chikhi, R., Lavenier, D., Chapuis, G., Naquin, D., Maillet, N., Schatz, M. C., Kelley, D. R., Phillippy, A. M., Koren, S., Yang, S. P., Wu, W., Chou, W. C., Srivastava, A., Shaw, T. I., Ruby, J. G., Skewes-Cox, P., Betegon, M., Dimon, M. T., Solovyev, V., Seledtsov, I., Kosarev, P., Vorobyev, D., Ramirez-Gonzalez, R., Leggett, R., MacLean, D., Xia, F., Luo, R., Li, Z., Xie, Y., Liu, B., Gnerre, S., MacCallum, I., Przybylski, D., Ribeiro, F. J., Sharpe, T., Hall, G., Kersey, P. J., Durbin, R., Jackman, S. D., Chapman, J. A., Huang, X., DeRisi, J. L., Caccamo, M., Li, Y., Jaffe, D. B., Green, R. E., Haussler, D., Korf, I., Paten, B. (2011) Assemblathon 1: A competitive assessment of de novo short read assembly methods. Genome Research, 21 (12). pp. 2224-2241. ISSN 10889051 (ISSN)
PDF (Paper)
Assemblathon-1-A-competitive-assessment-of-de-novo-short-read-assembly-methods.pdf - Published Version Restricted to Repository staff only Download (1MB) |
Abstract
Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome. We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies. In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome. A total of 41 assemblies from 17 different groups were received. Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made. We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods. The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://www.assemblathon.org/. © 2011 by Cold Spring Harbor Laboratory Press.
Item Type: | Paper |
---|---|
Uncontrolled Keywords: | accuracy computer program copy number variation gene deletion gene structure genetic analysis genome haplotype human indel mutation nonhuman priority journal review sequence analysis single nucleotide polymorphism |
Subjects: | bioinformatics > genomics and proteomics > annotation > sequence annotation bioinformatics > genomics and proteomics > analysis and processing > Sequence Data Processing bioinformatics > genomics and proteomics > Mapping and Rendering > Sequence Rendering |
CSHL Authors: | |
Communities: | CSHL labs > Schatz lab CSHL Cancer Center Program > Cancer Genetics |
Depositing User: | CSHL Librarian |
Date: | 2011 |
Date Deposited: | 16 Mar 2012 14:10 |
Last Modified: | 13 Oct 2015 20:40 |
PMCID: | PMC3227110 |
Related URLs: | |
URI: | https://repository.cshl.edu/id/eprint/25376 |
Actions (login required)
Administrator's edit/view item |