Gauge fixing for sequence-function relationships

Posfai, Anna, Zhou, Juannan, McCandlish, David M, Kinney, Justin B (May 2024) Gauge fixing for sequence-function relationships. bioRxiv.

[thumbnail of 10.1101.2024.05.12.593772.pdf] PDF
10.1101.2024.05.12.593772.pdf - Submitted Version
Available under License Creative Commons Attribution.

Download (3MB)
URL: https://www.ncbi.nlm.nih.gov/pubmed/38798671
DOI: 10.1101/2024.05.12.593772

Abstract

Quantitative models of sequence-function relationships are ubiquitous in computational biology, e.g., for modeling the DNA binding of transcription factors or the fitness landscapes of proteins. Interpreting these models, however, is complicated by the fact that the values of model parameters can often be changed without affecting model predictions. Before the values of model parameters can be meaningfully interpreted, one must remove these degrees of freedom (called "gauge freedoms" in physics) by imposing additional constraints (a process called "fixing the gauge"). However, strategies for fixing the gauge of sequence-function relationships have received little attention. Here we derive an analytically tractable family of gauges for a large class of sequence-function relationships. These gauges are derived in the context of models with all-order interactions, but an important subset of these gauges can be applied to diverse types of models, including additive models, pairwise-interaction models, and models with higher-order interactions. Many commonly used gauges are special cases of gauges within this family. We demonstrate the utility of this family of gauges by showing how different choices of gauge can be used both to explore complex activity landscapes and to reveal simplified models that are approximately correct within localized regions of sequence space. The results provide practical gauge-fixing strategies and demonstrate the utility of gauge-fixing for model exploration and interpretation.

Item Type: Paper
Subjects: bioinformatics
bioinformatics > genomics and proteomics
bioinformatics > quantitative biology
CSHL Authors:
Communities: CSHL labs > Kinney lab
CSHL labs > McCandlish lab
SWORD Depositor: CSHL Elements
Depositing User: CSHL Elements
Date: 13 May 2024
Date Deposited: 04 Jun 2024 15:00
Last Modified: 04 Jun 2024 15:00
PMCID: PMC11118547
Related URLs:
URI: https://repository.cshl.edu/id/eprint/41577

Actions (login required)

Administrator's edit/view item Administrator's edit/view item
CSHL HomeAbout CSHLResearchEducationNews & FeaturesCampus & Public EventsCareersGiving