Interpreting Cis -Regulatory Interactions from Large-Scale Deep Neural Networks for Genomics

Toneyan, Shushan, Koo, Peter K (July 2023) Interpreting Cis -Regulatory Interactions from Large-Scale Deep Neural Networks for Genomics. bioRxiv. (Submitted)

[thumbnail of 2023_Toneyan_Interpreting_Cis_Regulatory_Interactions_from.pdf] PDF
2023_Toneyan_Interpreting_Cis_Regulatory_Interactions_from.pdf - Submitted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB)

Abstract

The rise of large-scale, sequence-based deep neural networks (DNNs) for predicting gene expression has introduced challenges in their evaluation and interpretation. Current evaluations align DNN predictions with experimental perturbation assays, offering a limited perspective of the DNN’s capabilities within the studied loci. Moreover, existing model explainability tools mainly focus on motif analysis, which becomes complex to interpret for longer sequences. Here we introduce CREME, an in silico perturbation toolkit that interrogates large-scale DNNs to uncover rules of gene regulation that it has learned. Using CREME, we investigate Enformer, a prominent DNN in gene expression prediction, revealing cis-regulatory elements (CREs) that directly enhance or silence target genes. We explore the relationship between CRE distance from transcription start sites and gene expression, as well as the intricate complexity of higher-order CRE interactions. This work advances the ability to translate the powerful predictions of large-scale DNNs to study open questions in gene regulation.

Item Type: Paper
Subjects: bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > genes, structure and function > gene regulation
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > DNA, RNA structure, function, modification > genes, structure and function > gene regulation
bioinformatics > genomics and proteomics > genetics & nucleic acid processing > genomes
organs, tissues, organelles, cell types and functions > tissues types and functions > neural networks
CSHL Authors:
Communities: CSHL labs > Koo Lab
School of Biological Sciences > Publications
SWORD Depositor: CSHL Elements
Depositing User: CSHL Elements
Date: 3 July 2023
Date Deposited: 22 Sep 2023 13:39
Last Modified: 29 Feb 2024 18:16
PMCID: PMC10349992
Related URLs:
URI: https://repository.cshl.edu/id/eprint/40963

Actions (login required)

Administrator's edit/view item Administrator's edit/view item