Decoding sequence determinants of gene expression in diverse cellular and disease states

· Source: Machine learning : nature.com subject feeds · Field: Science & Research — Artificial Intelligence & Machine Learning, Life Sciences & Biology, Health & Medical Research · Depth: Expert, long

Summary

Decima is a new sequence-to-function model that predicts cell type- and condition-specific gene expression directly from genomic DNA sequence. This model addresses limitations of prior approaches by training on single-cell or single-nucleus RNA sequencing data from over 22 million cells, enabling predictions at a much higher resolution. Decima successfully predicts the cell-type-specific expression of previously unseen genes. The model reveals *cis*-regulatory mechanisms that drive cell-type-specific gene expression and its changes in disease, predicts noncoding-variant effects at cell type resolution, and facilitates the design of context-specific regulatory DNA elements. Its code and model weights are publicly available via GitHub and Zenodo.

Key takeaway

For research scientists investigating gene regulation or disease mechanisms, Decima offers a powerful tool to move beyond bulk analysis. You can use this model to precisely identify *cis*-regulatory elements and predict noncoding variant effects within specific cell types and disease states. Consider integrating Decima into your workflow to design targeted regulatory DNA elements for gene therapy or functional genomics studies, enhancing the specificity of your interventions.

Key insights

Decima predicts cell-type-specific gene expression from DNA sequence, enabling precise regulatory mechanism and variant effect analysis.

Principles

Method

Decima is trained on >22 million single-cell/nucleus RNA-seq datasets to predict gene expression from surrounding DNA sequence, then used for attribution analysis and directed evolution.

In practice

Topics

Code references

Best for: AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.