YMS Series #20 – Protein Language Models

May 04, 2026
17:30 EEST
The YMS Series brings together the RSBI community and beyond to engage in discussions on cutting-edge science and research journeys. We especially invite young researchers to attend, offering them a chance to explore diverse research directions, gain inspiration, and discover potential career paths in science.

 

On Monday, May 04, at 17:30 EEST we are delighted to host our guests  Dr. Michael Heinzinger and Jan-Philipp Leusch, who will present latest research in the context of #protein #folding, by modeling local energetic #frustation using #AI.

 

Talk:

Dr. Michael Heinzinger and Jan-Philipp Leusch, "FrustrAI-Seq: Scaling Local Energetic Frustration to the Protein Sequence Space", Helmholtz Zentrum Munich, Munich, Germany

Abstract: Proteins fold into their native three-dimensional (3D) structures
by navigating complex energy landscapes shaped by the biophysical and biochemical properties of their sequence. Once folded, some sequence positions (dubbed residues) remain locally frustrated, reflecting functional constraints incompatible with optimal packing. This local energetic frustration provides important insights into protein function and dynamics, but its analysis typically relies on structure-based energy calculations and remains energetically costly at scale. Here, we introduce an ultra-fast sequence-based prediction of local energetic frustration directly from protein sequences using embeddings from protein language models (pLMs). Our method, coined FrustrAI-Seq, enables proteome-wide frustration profiling in minutes (∼17 minutes for the entire human proteome on a single Nvidia H100 GPU) while retaining biologically relevant performance as shown for the α-globin and β-lactamase family. By eliminating the need for explicit structural or evolutionary information, this approach expands frustration analysis to protein regions and classes that were previously inaccessible, including intrinsically disordered regions and high-throughput de novo designed protein datasets. To support reproducibility and large-scale applications, we provide the largest freely available resource of precomputed local frustration scores to date (∼ 106 proteins), along with model weights and complete training and inference code at: github.com/leuschjanphilipp/FrustrAI-Seq.

Keywords: Local Energetic Frustration, Frustration, Protein Language Model, Frus- tratometer, FrustraEvo

Resources:

 

Format: Online, free to attend, registration required: zoom

Organization: YMS is organized by RSBI and is linked to RoBioinfo Seminars

Chair: Madalina Giurgiu-Kraljic

Click here to subscribe for upcoming #YMS Seminars and #RSBI Events

Check all bioinformatics events on CRB website

Alte evenimente

YMS Series #20 – Protein Language Models

May 04, 2026

YMS Series #19 – Nanomedicine & Liposome Drug Delivery

April 20, 2026

YMS Series #18 – Bacterial WGS analysis on South African isolates

April 06, 2026

YMS Series #17 – Resistome and microbiome-immune interactions

March 16, 2026

RoBioinfo 2026 Conference

Timișoara
13-15 May, 2026