GaugeFixer: overcoming parameter non-identifiability in models of sequence-function relationships

Martí-Gómez, Carlos, McCandlish, David Martin, Kinney, Justin Block (December 2025) GaugeFixer: overcoming parameter non-identifiability in models of sequence-function relationships. bioRxiv. ISSN 2692-8205 (Submitted)

[thumbnail of 10.64898.2025.12.08.693054.pdf] PDF
10.64898.2025.12.08.693054.pdf - Submitted Version
Available under License Creative Commons Attribution.

Download (562kB)

Abstract

Mathematical models that describe sequence-function relationships are widely used in computational biology. A key challenge when interpreting these models is that their parameters are not uniquely determined: many different parameter choices can encode the same sequence-function landscape. These ambiguities, known as "gauge freedoms," must be resolved before parameter values can be meaningfully interpreted. Resolving gauge freedoms requires imposing mathematical constraints on parameters that remove these degrees of freedom, a procedure called "fixing the gauge." We recently developed mathematical methods for fixing the gauge of a large class of commonly used models, but the direct computational implementation of these methods is often impractical due to the need for projection matrices whose memory requirements scale quadratically with the number of parameters. Here we introduce GaugeFixer, a Python package that exploits the specific mathematical structure of gauge-fixing projections to achieve linear scaling, thus enabling application to models with millions of parameters. As one application, we analyzed the local structure of peaks in an empirical fitness landscape for translation initiation. GaugeFixer reveals striking similarities, but also fine-scaled variation, in ribosome binding preferences at different positions relative to the start codon, thereby aiding the interpretation of an otherwise unwieldy fitness landscape. GaugeFixer thus fills an unmet need in the computational tools available for biologically interpreting sequence-function relationships. Availability and implementation: GaugeFixer is compatible with Python ≥ 3.10 and can be installed using the pip package. Documentation is provided at http://gaugefixer.readthedocs.io. Source code is available at http://github.com/jbkinney/gaugefixer, as are the scripts used to carry out the analyses presented here.

Item Type: Paper
Subjects: bioinformatics
bioinformatics > quantitative biology
CSHL Authors:
Communities: CSHL labs > Kinney lab
CSHL labs > McCandlish lab
CSHL Post Doctoral Fellows
SWORD Depositor: CSHL Elements
Depositing User: CSHL Elements
Date: 10 December 2025
Date Deposited: 11 Dec 2025 19:15
Last Modified: 11 Dec 2025 19:15
Related URLs:
URI: https://repository.cshl.edu/id/eprint/42037

Actions (login required)

Administrator's edit/view item Administrator's edit/view item