Systematic evaluation of spliced alignment programs for RNA-seq data

Engström, P. G., Steijger, T., Sipos, B., Grant, G. R., Kahles, A., Rätsch, G., Goldman, N., Hubbard, T. J., Harrow, J., Guigó, R., Bertone, P., Alioto, T., Behr, J., Bohnert, R., Campagna, D., Davis, C. A., Dobin, A., Gingeras, T. R., Jean, G., Kosarev, P., Li, S., Liu, J., Mason, C. E., Molodtsov, V., Ning, Z., Ponstingl, H., Prins, J. F., Ribeca, P., Seledtsov, I., Solovyev, V., Valle, G., Vitulo, N., Wang, K., Wu, T. D., Zeller, G. (December 2013) Systematic evaluation of spliced alignment programs for RNA-seq data. Nature Methods, 10 (12). pp. 1185-1191. ISSN 15487091 (ISSN)

[thumbnail of Gingeras_NatMet_2014.pdf]
Preview
PDF
Gingeras_NatMet_2014.pdf

Download (1MB) | Preview

Abstract

High-throughput RNA sequencing is an increasingly accessible method for studying gene structure and activity on a genome-wide scale. A critical step in RNA-seq data analysis is the alignment of partial transcript reads to a reference genome sequence. To assess the performance of current mapping software, we invited developers of RNA-seq aligners to process four large human and mouse RNA-seq data sets. In total, we compared 26 mapping protocols based on 11 programs and pipelines and found major performance differences between methods on numerous benchmarks, including alignment yield, basewise accuracy, mismatch and gap placement, exon junction discovery and suitability of alignments for transcript reconstruction. We observed concordant results on real and simulated RNA-seq data, confirming the relevance of the metrics employed. Future developments in RNA-seq alignment methods would benefit from improved placement of multimapped reads, balanced utilization of existing gene annotation and a reduced false discovery rate for splice junctions. © 2013 Nature America, Inc.

Item Type: Paper
Subjects: bioinformatics > genomics and proteomics > analysis and processing > alignment processing
bioinformatics
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification
bioinformatics > genomics and proteomics > genetics & nucleic acid processing
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > RNA splicing
CSHL Authors:
Communities: CSHL labs > Gingeras lab
CSHL labs > Dobin Lab
Depositing User: Matt Covey
Date: December 2013
Date Deposited: 23 Dec 2013 19:53
Last Modified: 08 Jul 2020 18:14
PMCID: PMC4018468
Related URLs:
URI: https://repository.cshl.edu/id/eprint/29158

Actions (login required)

Administrator's edit/view item Administrator's edit/view item