Designing Protein Binders Using the Generative Model Proteina-Complexa

· Source: NVIDIA Technical Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Computational Biology · Depth: Intermediate, medium

Summary

NVIDIA has released Proteina-Complexa, a generative model designed for de novo protein binder and enzyme design. This model utilizes a partially latent flow-matching framework, building on the La-Proteina model, to co-design both fully atomistic binder structures (backbone and side-chain) and their corresponding amino acid sequences. It was trained on over 1 million curated experimental and predicted structures from sources like the Protein Data Bank and AlphaFold. Proteina-Complexa integrates inference-time compute scaling with "reasoning" search algorithms to iteratively optimize designs, enhancing computational efficiency and binder quality. The model supports use cases including protein binders for disease-relevant protein targets, binders for small molecule targets in applications like targeted drug delivery, and de novo enzyme design for industrial biocatalysis. Extensive experimental validation, including testing 1 million candidates against 133 distinct protein targets, demonstrated its ability to generate high-affinity binders, even for challenging targets like sugar molecules.

Key takeaway

For AI Engineers and Research Scientists working on protein-based therapeutics or catalysts, Proteina-Complexa offers a unified platform to design high-affinity protein binders and enzymes. You should explore its co-design capabilities and inference-time optimization to generate novel biomolecules, potentially tackling targets previously considered intractable. Consider integrating its command-line interface into your workflows to accelerate experimental testing and discovery.

Key insights

Proteina-Complexa co-designs protein binders and enzymes using a latent flow-matching generative model with inference-time optimization.

Principles

Method

Proteina-Complexa employs a partially latent flow-matching framework to generate atomistic protein structures and sequences simultaneously, then refines designs using search algorithms and reward functions during inference.

In practice

Topics

Code references

Best for: AI Engineer, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by NVIDIA Technical Blog.