Sun, Mengyi, Stoltzfus, Arlin, McCandlish, David M (October 2024) A fitness distribution law for amino-acid replacements. bioRxiv. ISSN 2692-8205 (Public Dataset) (Submitted)
Preview |
PDF
10.1101.2024.10.11.617952.pdf - Submitted Version Available under License Creative Commons Attribution. Download (5MB) | Preview |
Abstract
The effect of replacing the amino acid at a given site in a protein is difficult to predict. Yet, evolutionary comparisons have revealed highly regular patterns of interchangeability between pairs of amino acids, and such patterns have proved enormously useful in a range of applications in bioinformatics, evolutionary inference, and protein design. Here we reconcile these apparently contradictory observations using fitness data from over 350,000 experimental amino acid replacements. Almost one-quarter of the 20 × 19 = 380 types of replacements have broad distributions of fitness effects (DFEs) that closely resemble the background DFE for random changes, indicating an overwhelming influence of protein context in determining mutational effects. However, we also observe that the 380 pair-specific DFEs closely follow a maximum entropy distribution, specifically a truncated exponential distribution. The shape of this distribution is determined entirely by its mean, which is equivalent to the chance that a replacement of the given type is fitter than a random replacement. In this type of distribution, modest deviations in the mean correspond to much larger changes in the probability of falling in the far right tail, so that modest differences in mean exchangeability may result in much larger differences in the chance of a highly fit mutation. Indeed, we show that under the assumption that purifying selection filters out the vast majority of mutations, the maximum entropy distributions of fitness effects inferred from deep mutational scanning experiments predict the characteristic patterns of amino acid change observed in molecular evolution. These maximum entropy distributions of mutational effects not only provide a tuneable model for molecular evolution, but also have implications for mutational effect prediction and protein engineering.
Item Type: | Paper |
---|---|
Subjects: | bioinformatics |
CSHL Authors: | |
Communities: | CSHL labs > McCandlish lab |
SWORD Depositor: | CSHL Elements |
Depositing User: | CSHL Elements |
Date: | 15 October 2024 |
Date Deposited: | 04 Nov 2024 14:01 |
Last Modified: | 04 Nov 2024 14:01 |
PMCID: | PMC11507765 |
Related URLs: | |
Dataset ID: |
|
URI: | https://repository.cshl.edu/id/eprint/41727 |
Actions (login required)
Administrator's edit/view item |