Programmable RNA translation through deep learning-driven IRES discovery and de novo generation

· Source: Nature Machine Intelligence · Field: Science & Research — Life Sciences & Biology, Mathematics & Computational Sciences, Health & Medical Research · Depth: Expert, long

Summary

A new end-to-end artificial intelligence framework addresses the challenge of precisely controlling protein expression in RNA therapeutics by unifying Internal Ribosome Entry Site (IRES) identification, evolutionary optimization, and de novo generation. The framework includes IRES-LM, an ensemble of two language models trained on 46,774 sequences, which improves IRES prediction by 15% in AUC and F1 score over existing methods and accurately identifies all 21 experimentally validated circular RNA IRESs. IRES-EA integrates an evolutionary algorithm with IRES-LM, achieving 98.4% acquired IRES functionality in 12,000 mutated sequences. Additionally, IRES-DM, a diffusion model, de novo generates novel IRES sequences with 99.3% detectable functionality from 12,000 generated sequences, producing diverse designs from natural-like to structurally conserved but sequence-divergent. This comprehensive AI framework offers a robust approach for programmable RNA translation, enhancing the molecular toolkit for biomedical discovery and RNA-based therapeutics.

Key takeaway

For AI Scientists and Research Scientists working on RNA therapeutics, this framework offers a powerful suite of tools to overcome protein expression bottlenecks. You should explore integrating IRES-LM for superior IRES identification, IRES-EA for optimizing existing RNA sequences, and IRES-DM for generating novel, functional IRES elements. This can significantly accelerate the design and development of next-generation RNA-based therapies and synthetic biology applications.

Key insights

An AI framework unifies IRES identification, optimization, and de novo generation for programmable RNA translation.

Principles

Method

The framework uses IRES-LM (language models) for prediction, IRES-EA (evolutionary algorithm) for targeted mutation, and IRES-DM (diffusion model) for de novo sequence generation, validated by massively parallel reporter assays.

In practice

Topics

Code references

Best for: AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Nature Machine Intelligence.