Mining ChIP-chip data for transcription factor and cofactor binding sites

Smith, A. D., Sumazin, P., Das, D., Zhang, M. Q. (June 2005) Mining ChIP-chip data for transcription factor and cofactor binding sites. Bioinformatics, 21 Sup. i403-12. ISSN 1367-4803 (Print)

URL: http://bioinformatics.oxfordjournals.org/cgi/conte...
DOI: 10.1093/bioinformatics/bti1043

Abstract

MOTIVATION: Identification of single motifs and motif pairs that can be used to predict transcription factor localization in ChIP-chip data, and gene expression in tissue-specific microarray data. RESULTS: We describe methodology to identify de novo individual and interacting pairs of binding site motifs from ChIP-chip data, using an algorithm that integrates localization data directly into the motif discovery process. We combine matrix-enumeration based motif discovery with multivariate regression to evaluate candidate motifs and identify motif interactions. When applied to the HNF localization data in liver and pancreatic islets, our methods produce motifs that are either novel or improved known motifs. All motif pairs identified to predict localization are further evaluated according to how well they predict expression in liver and islets and according to how conserved are the relative positions of their occurrences. We find that interaction models of HNF1 and CDP motifs provide excellent prediction of both HNF1 localization and gene expression in liver. Our results demonstrate that ChIP-chip data can be used to identify interacting binding site motifs. AVAILABILITY: Motif discovery programs and analysis tools are available on request from the authors.

Item Type: Paper
Uncontrolled Keywords: Algorithms Amino Acid Motifs Animals Binding Sites Chromatin Immunoprecipitation methods Computational Biology methods Gene Expression Regulation Humans Liver metabolism Models Statistical Multivariate Analysis Protein Array Analysis Protein Structure Tertiary Transcription Factors chemistry matrix enumeration
Subjects: bioinformatics > computational biology
bioinformatics > genomics and proteomics > datasets
CSHL Authors:
Communities: CSHL labs > Zhang lab
Depositing User: CSHL Librarian
Date: June 2005
Date Deposited: 05 Jan 2012 19:58
Last Modified: 05 Jan 2012 19:58
URI: http://repository.cshl.edu/id/eprint/22707

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving