A generalizable Hi-C foundation model for chromatin architecture, single-cell and multiomics analysis across species

· Source: Machine learning : nature.com subject feeds · Field: Science & Research — Life Sciences & Biology, Artificial Intelligence & Machine Learning, Research Methodology & Innovation · Depth: Expert, extended

Summary

HiCFoundation is a novel foundation model designed for the integrative analysis of three-dimensional (3D) chromatin structure, measured by high-throughput chromosome conformation capture (Hi-C), and associated epigenomic regulation, such as assay for transposase-accessible chromatin using sequencing (ATAC-seq) and chromatin immunoprecipitation followed by sequencing (ChIP-seq). Trained on extensive Hi-C data, this model achieves high performance and generalizability across species for various 3D genome analyses, including reproducibility analysis, resolution enhancement, and loop detection. HiCFoundation can also predict diverse epigenomic activities directly from Hi-C data, elucidating the link between 3D structure and regulatory function. Furthermore, the model adapts easily to single-cell Hi-C data, offering a versatile and interpretable framework for studying the 3D genome and its functional roles across different cell types and species. The code is available on GitHub and Zenodo with an Apache 2.0 license.

Key takeaway

For genomics researchers analyzing complex chromatin structures, you should consider integrating HiCFoundation into your workflow to overcome challenges in combining 3D structure and epigenomic data. This model provides a unified, interpretable framework for multi-species analysis, improving reproducibility, resolution, and loop detection. Its ability to predict epigenomic activities from Hi-C data can accelerate your understanding of regulatory functions. Explore the GitHub repository for implementation.

Key insights

HiCFoundation integrates 3D chromatin structure and epigenomic data across species using a generalizable foundation model.

Principles

Method

HiCFoundation is pre-trained on massive Hi-C data, then fine-tuned for specific tasks like reproducibility analysis, resolution enhancement, loop detection, and epigenomic activity prediction.

In practice

Topics

Code references

Best for: AI Scientist, Research Scientist, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.