Computational predictors fail to identify amino acid substitution effects at rheostat positions

Maximilian Miller, Yana Bromberg, Liskin Swint-Kruse

January 2017

Abstract

Many computational approaches exist for predicting the effects of amino acid substitutions. Here, we considered whether the protein sequence position class - rheostat or toggle - affects these predictions. The classes are defined as follows: experimentally evaluated effects of amino acid substitutions at toggle positions are binary, while rheostat positions show progressive changes. For substitutions in the LacI protein, all evaluated methods failed two key expectations: toggle neutrals were incorrectly predicted as more non-neutral than rheostat non-neutrals, while toggle and rheostat neutrals were incorrectly predicted to be different. However, toggle non-neutrals were distinct from rheostat neutrals. Since many toggle positions are conserved, and most rheostats are not, predictors appear to annotate position conservation better than mutational effect. This finding can explain the well-known observation that predictors assign disproportionate weight to conservation, as well as the field’s inability to improve predictor performance. Thus, building reliable predictors requires distinguishing between rheostat and toggle positions.

Type

Journal article

Publication

Scientific Reports

Maximilian Miller

PostDoctoral Associate

improving variant effect predictions, deciphering metagenomes and creating an efficient service infrastructure

Yana Bromberg

Principal Investigator - Professor of Bioinformatics

My research focuses on deciphering the DNA blueprints of life’s molecular machinery