Modeling RNA-Binding Protein Specificity In Vivo by Precisely Registering Protein-RNA Crosslink Sites

Feng, H., Bao, S., Rahman, M. A., Weyn-Vanhentenryck, S. M., Khan, A., Wong, J., Shah, A., Flynn, E. D., Krainer, A. R., Zhang, C. (June 2019) Modeling RNA-Binding Protein Specificity In Vivo by Precisely Registering Protein-RNA Crosslink Sites. Mol Cell, 74 (6). 1189-1204.e6. ISSN 1097-2765

URL: https://www.ncbi.nlm.nih.gov/pubmed/31226278
DOI: 10.1016/j.molcel.2019.02.002

Abstract

RNA-binding proteins (RBPs) regulate post-transcriptional gene expression by recognizing short and degenerate sequence motifs in their target transcripts, but precisely defining their binding specificity remains challenging. Crosslinking and immunoprecipitation (CLIP) allows for mapping of the exact protein-RNA crosslink sites, which frequently reside at specific positions in RBP motifs at single-nucleotide resolution. Here, we have developed a computational method, named mCross, to jointly model RBP binding specificity while precisely registering the crosslinking position in motif sites. We applied mCross to 112 RBPs using ENCODE eCLIP data and validated the reliability of the discovered motifs by genome-wide analysis of allelic binding sites. Our analyses revealed that the prototypical SR protein SRSF1 recognizes clusters of GGA half-sites in addition to its canonical GGAGGA motif. Therefore, SRSF1 regulates splicing of a much larger repertoire of transcripts than previously appreciated, including HNRNPD and HNRNPDL, which are involved in multivalent protein assemblies and phase separation.

Item Type: Paper
Subjects: bioinformatics
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification
bioinformatics > genomics and proteomics > genetics & nucleic acid processing
bioinformatics > genomics and proteomics
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > protein structure, function, modification
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > Alternative Splicing
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > protein structure, function, modification > protein types
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > protein structure, function, modification > protein types > RNA binding protein
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > protein structure, function, modification > protein types > splicing factor
CSHL Authors:
Communities: CSHL labs > Krainer lab
CSHL Cancer Center Program > Gene Regulation and Inheritance Program
Depositing User: Matthew Dunn
Date: 20 June 2019
Date Deposited: 29 Jul 2019 13:25
Last Modified: 02 Feb 2024 15:01
PMCID: PMC6676488
Related URLs:
URI: https://repository.cshl.edu/id/eprint/38158

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving