๐ฌ The Coolest Diffusion Research Isn't in LLMs โ Evan Feinberg & Sergey Edunov, Genesis Molecular AI
Summary
Genesis Molecular AI, co-founded by Evan Feinberg and led by former Llama 2/3 lead Sergey Edunov, is advancing AI in drug discovery, particularly in 3D protein-ligand structure prediction. Their PERL model leverages diffusion networks and physics-based guidance, achieving sub-1 Angstrom accuracy, a crucial resolution for effective drug design. This capability, demonstrated by outperforming other models on challenging dynamic targets like the EVA721A protease in the OpenBind challenge, addresses historically "undruggable" targets and improves existing therapies. Genesis integrates synthetic data generation and physical priors to overcome limited public datasets. The company focuses on small and medium-sized molecule discovery, collaborating with pharmaceutical partners like Incyte, and is developing AI agents to enhance drug hunters' productivity by orchestrating complex tools.
Key takeaway
For AI Scientists and Machine Learning Engineers developing drug discovery models, you should prioritize sub-1 Angstrom accuracy in 3D protein-ligand prediction. This resolution is crucial for enabling effective downstream potency prediction and prospective design, moving beyond traditional 2 Angstrom benchmarks. Focus on integrating physical priors and synthetic data generation, as demonstrated by PERL, to achieve generalizable models that can tackle "undruggable" targets and accelerate therapeutic development.
Key insights
Diffusion models, enhanced with physical priors and synthetic data, are achieving critical resolution in 3D protein-ligand prediction, transforming drug discovery.
Principles
- Diffusion models are superior for 3D molecular generation than GANs.
- Sub-1 Angstrom accuracy is critical for effective drug design.
- AI models benefit from physical priors and synthetic data.
Method
PERL uses a diffusion-based head for iterative structure refinement, guided by physics-based feedback and synthetic data from molecular physics simulations, similar to LLM scaling.
In practice
- Use diffusion models for 3D molecular structure prediction.
- Integrate physics-based guidance for higher accuracy.
- Develop AI agents to orchestrate complex drug discovery tools.
Topics
- Diffusion Models
- Protein-Ligand Prediction
- Drug Discovery AI
- Molecular Co-folding
- AI Agents
- Synthetic Data Generation
Best for: AI Scientist, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Latent.Space - Www.latent.space.