Telomere-to-Telomere Assembly Using HERRO-Corrected Simplex Nanopore Reads
Summary
The HERRO (Haplotype-aware ERRor cOrrection) framework, a deep learning-based method, enables the generation of telomere-to-telomere (T2T) phased genome assemblies using only ultra-long Oxford Nanopore Technologies (ONT) Simplex reads. Traditionally, T2T assemblies for diploid and polyploid genomes require combining high-accuracy long reads like PacBio HiFi or ONT Duplex with ultra-long ONT Simplex reads, increasing cost and DNA requirements. HERRO corrects ONT Simplex reads while preserving haplotype differences, achieving up to a 100-fold increase in read accuracy for diploid human genomes. When combined with the Verkko assembler, HERRO reconstructs up to 32 chromosomes telomere-to-telomere, including X and Y, and consistently yields NGA50 values of 100 Mb or higher across multiple human genomes. The framework supports both R9.4.1 and R10.4.1 ONT Simplex reads and is generalizable to other species, demonstrating a path to lower sequencing costs and improved genomic analysis quality.
Key takeaway
For genomics researchers aiming to achieve high-quality telomere-to-telomere genome assemblies, HERRO offers a significant advancement. You can now achieve benchmark-quality results using only ultra-long ONT Simplex reads, potentially reducing the need for multiple sequencing platforms and lowering overall project costs. Consider integrating HERRO into your assembly pipeline to improve read accuracy and achieve more complete, phased genome reconstructions, especially for complex diploid or polyploid genomes.
Key insights
HERRO enables high-quality telomere-to-telomere genome assembly using only error-corrected ONT Simplex reads.
Principles
- Haplotype-aware error correction improves assembly quality.
- Single-platform sequencing can achieve T2T assembly benchmarks.
Method
HERRO uses deep learning to correct ultra-long ONT Simplex reads, preserving haplotype differences, then combines these corrected reads with a de novo assembler like Verkko for T2T assembly.
In practice
- Use HERRO for cost-effective T2T genome assembly.
- Apply HERRO to R9.4.1 and R10.4.1 ONT Simplex data.
Topics
- Telomere-to-Telomere Assembly
- ONT Simplex Reads
- HERRO Framework
- Deep Learning
- Haplotype-aware Correction
Best for: AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.