Kerfuffle: a web tool for multi-species gene colocalization analysis

Aboukhalil, R., Fendler, B., Atwal, G. S. (January 2013) Kerfuffle: a web tool for multi-species gene colocalization analysis. BMC Bioinformatics, 14. ISSN 1471-2105

[thumbnail of Paper]
Preview
PDF (Paper)
Atwal BMC Bioinformatics 2013.pdf - Published Version

Download (1MB) | Preview

Abstract

Background: The evolutionary pressures that underlie the large-scale functional organization of the genome are not well understood in eukaryotes. Recent evidence suggests that functionally similar genes may colocalize (cluster) in the eukaryotic genome, suggesting the role of chromatin-level gene regulation in shaping the physical distribution of coordinated genes. However, few of the bioinformatic tools currently available allow for a systematic study of gene colocalization across several, evolutionarily distant species. Furthermore, most tools require the user to input manually curated lists of gene position information, DNA sequence or gene homology relations between species. With the growing number of sequenced genomes, there is a need to provide new comparative genomics tools that can address the analysis of multi-species gene colocalization. Results: Kerfuffle is a web tool designed to help discover, visualize, and quantify the physical organization of genomes by identifying significant gene colocalization and conservation across the assembled genomes of available species (currently up to 47, from humans to worms). Kerfuffle only requires the user to specify a list of human genes and the names of other species of interest. Without further input from the user, the software queries the e!Ensembl BioMart server to obtain positional information and discovers homology relations in all genes and species specified. Using this information, Kerfuffle performs a multi-species clustering analysis, presents downloadable lists of clustered genes, performs Monte Carlo statistical significance calculations, estimates how conserved gene clusters are across species, plots histograms and interactive graphs, allows users to save their queries, and generates a downloadable visualization of the clusters using the Circos software. These analyses may be used to further explore the functional roles of gene clusters by interrogating the enriched molecular pathways associated with each cluster. Conclusions: Kerfuffle is a new, easy-to-use and publicly available tool to aid our understanding of functional genomics and comparative genomics. This software allows for flexibility and quick investigations of a user-defined set of genes, and the results may be saved online for further analysis. Kerfuffle is freely available at http://atwallab.org/kerfuffle, is implemented in JavaScript (using jQuery and jsCharts libraries) and PHP 5.2, runs on an Apache server, and stores data in flat files and an SQLite database.

Item Type: Paper
Uncontrolled Keywords: Genes Clusters Colocalization Conservation Synteny data sets genomes clusters synteny order
Subjects: bioinformatics
Investigative techniques and equipment > comparative genomic hybridization
Investigative techniques and equipment > assays > comparative genomic hybridization
bioinformatics > genomics and proteomics > computers
bioinformatics > genomics and proteomics
bioinformatics > genomics and proteomics > computers > computer software
CSHL Authors:
Communities: CSHL Post Doctoral Fellows
CSHL labs > Atwal lab
School of Biological Sciences > Publications
CSHL Cancer Center Program > Cancer Genetics
Depositing User: Matt Covey
Date: January 2013
Date Deposited: 29 Mar 2013 19:40
Last Modified: 13 Oct 2015 19:23
PMCID: PMC3598493
Related URLs:
URI: https://repository.cshl.edu/id/eprint/28051

Actions (login required)

Administrator's edit/view item Administrator's edit/view item