Characterizing the state of the art in the computational assignment of gene function: lessons from the first critical assessment of functional annotation (CAFA)

Gillis, J., Pavlidis, P. (2013) Characterizing the state of the art in the computational assignment of gene function: lessons from the first critical assessment of functional annotation (CAFA). BMC Bioinformatics, 14 (Suppl). S15. ISSN 1471-2105 (Electronic)1471-2105 (Linking)

[thumbnail of Paper]
Preview
PDF (Paper)
Gillis BMC Bioinformatics 2013.pdf - Published Version

Download (590kB) | Preview
URL: http://www.ncbi.nlm.nih.gov/pubmed/23630983
DOI: 10.1186/1471-2105-14-S3-S15

Abstract

The assignment of gene function remains a difficult but important task in computational biology. The establishment of the first Critical Assessment of Functional Annotation (CAFA) was aimed at increasing progress in the field. We present an independent analysis of the results of CAFA, aimed at identifying challenges in assessment and at understanding trends in prediction performance. We found that well-accepted methods based on sequence similarity (i.e., BLAST) have a dominant effect. Many of the most informative predictions turned out to be either recovering existing knowledge about sequence similarity or were "post-dictions" already documented in the literature. These results indicate that deep challenges remain in even defining the task of function assignment, with a particular difficulty posed by the problem of defining function in a way that is not dependent on either flawed gold standards or the input data itself. In particular, we suggest that using the Gene Ontology (or other similar systematizations of function) as a gold standard is unlikely to be the way forward.

Item Type: Paper
Additional Information:
Uncontrolled Keywords: critical assessment of functional annotation (CAFA)
Subjects: bioinformatics
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification
bioinformatics > genomics and proteomics > genetics & nucleic acid processing
bioinformatics > genomics and proteomics
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > genes, structure and function
CSHL Authors:
Communities: CSHL labs > Gillis Lab
Depositing User: Matt Covey
Date: 2013
Date Deposited: 23 May 2013 15:03
Last Modified: 02 Aug 2013 14:04
PMCID: PMC3633048
Related URLs:
URI: https://repository.cshl.edu/id/eprint/28324

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving