funtrp: identifying protein positions for variation driven functional tuning

Abstract

Evaluating the impact of non-synonymous genetic variants is essential for uncovering disease associations and mechanisms of evolution. An in-depth understanding of sequence changes is also fundamental for synthetic protein design and stability assessments. However, the variant effect predictor performance gain observed in recent years has not kept up with the increased complexity of new methods. One likely reason for this might be that most approaches use similar sets of gene and protein features for modeling variant effects, often emphasizing sequence conservation. While high levels of conservation highlight residues essential for protein activity, much of the variation observable in vivo is arguably weaker in its impact, thus requiring evaluation at a higher level of resolution. Here, we describe functionNeutral/Toggle/Rheostatpredictor (funtrp), a novel computational method that categorizes protein positions based on the position-specific expected range of mutational impacts: Neutral (weak/no effects), Rheostat (function-tuning positions), or Toggle (on/off switches). We show that position types do not correlate strongly with familiar protein features such as conservation or protein disorder. We also find that position type distribution varies across different protein functions. Finally, we demonstrate that position types can improve performance of existing variant effect predictors and suggest a way forward for the development of new ones.

Publication
Nucleic Acids Research
Maximilian Miller
Maximilian Miller
PostDoctoral Associate

improving variant effect predictions, deciphering metagenomes and creating an efficient service infrastructure

Daniel Vitale
Daniel Vitale
Student

Building a pipeline for processing of deep mutational scanning datasets (cleaning, normalization, analysis) using python and R

Yana Bromberg
Yana Bromberg
Principal Investigator - Professor of Bioinformatics

My research focuses on deciphering the DNA blueprints of life’s molecular machinery