A fitness distribution law for amino-acid replacements

Sun, Mengyi, Stoltzfus, Arlin, McCandlish, David M (October 2024) A fitness distribution law for amino-acid replacements. bioRxiv. ISSN 2692-8205 (Public Dataset) (Submitted)

[thumbnail of 10.1101.2024.10.11.617952.pdf]
Preview
PDF
10.1101.2024.10.11.617952.pdf - Submitted Version
Available under License Creative Commons Attribution.

Download (5MB) | Preview

Abstract

The effect of replacing the amino acid at a given site in a protein is difficult to predict. Yet, evolutionary comparisons have revealed highly regular patterns of interchangeability between pairs of amino acids, and such patterns have proved enormously useful in a range of applications in bioinformatics, evolutionary inference, and protein design. Here we reconcile these apparently contradictory observations using fitness data from over 350,000 experimental amino acid replacements. Almost one-quarter of the 20 × 19 = 380 types of replacements have broad distributions of fitness effects (DFEs) that closely resemble the background DFE for random changes, indicating an overwhelming influence of protein context in determining mutational effects. However, we also observe that the 380 pair-specific DFEs closely follow a maximum entropy distribution, specifically a truncated exponential distribution. The shape of this distribution is determined entirely by its mean, which is equivalent to the chance that a replacement of the given type is fitter than a random replacement. In this type of distribution, modest deviations in the mean correspond to much larger changes in the probability of falling in the far right tail, so that modest differences in mean exchangeability may result in much larger differences in the chance of a highly fit mutation. Indeed, we show that under the assumption that purifying selection filters out the vast majority of mutations, the maximum entropy distributions of fitness effects inferred from deep mutational scanning experiments predict the characteristic patterns of amino acid change observed in molecular evolution. These maximum entropy distributions of mutational effects not only provide a tuneable model for molecular evolution, but also have implications for mutational effect prediction and protein engineering.

Item Type: Paper
Subjects: bioinformatics
CSHL Authors:
Communities: CSHL labs > McCandlish lab
SWORD Depositor: CSHL Elements
Depositing User: CSHL Elements
Date: 15 October 2024
Date Deposited: 04 Nov 2024 14:01
Last Modified: 04 Nov 2024 14:01
PMCID: PMC11507765
Related URLs:
Dataset ID:
  • https://github.com/mengysun/AA exchangeability
URI: https://repository.cshl.edu/id/eprint/41727

Actions (login required)

Administrator's edit/view item Administrator's edit/view item