An Evolutionary Approach for Designing Stable and Highly Expressible Low-Immunogenicity Therapeutic mRNA Sequences

· Source: Takara TLDR - Daily AI Papers · Field: Science & Research — Life Sciences & Biology, Mathematics & Computational Sciences, Artificial Intelligence & Machine Learning · Depth: Expert, medium

Summary

A novel two-stage in-silico framework integrates deep learning and evolutionary computation to optimize therapeutic mRNA sequences for stability, expressibility, and low immunogenicity. The first stage employs a pretrained CodonTransformer, a BERT-like Large Language Model, to generate biologically coherent mRNA sequences encoding a target antigen. Subsequently, a genetic algorithm refines these candidates through codon-aware crossover and synonymous mutation, guided by human codon usage preferences. Fitness functions evaluate translation metrics like CAI (achieving 0.73-0.74) and tAI (0.63-0.64), mRNA structural stability (global MFE of -346 to -356 kcal/mol, 84% base-paired), and reduced immunogenicity (average immune penalty lowered to 27.3). This BERT-GA framework improves CAI and tAI by over 6% and achieves a high codon-pair bias of 0.97, along with an unpaired_30 fraction of 0.87 for ribosomal accessibility. Unlike methods that prioritize extreme stability or translation efficiency alone, this approach balances these critical properties.

Key takeaway

For research scientists designing therapeutic mRNA sequences, you should consider integrating deep learning with evolutionary computation to achieve optimal balance across critical properties. Your current methods might over-optimize for single traits like extreme stability or translation efficiency, potentially compromising overall therapeutic efficacy. Adopt a multi-objective optimization approach, like the BERT-GA framework, to simultaneously enhance translation, structural stability, and reduce immunogenicity in your mRNA designs.

Key insights

The BERT-GA framework optimizes therapeutic mRNA sequences for balanced translation efficiency, structural stability, and low immunogenicity.

Principles

Method

A two-stage in-silico framework: CodonTransformer generates initial sequences, then a genetic algorithm evolves them using codon-aware crossover and synonymous mutation, guided by multi-objective fitness functions.

In practice

Topics

Code references

Best for: AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.