Talos: Scaling rare disease diagnosis with automated, iterative genomic reanalysis
Summary
Talos is an open-source tool designed for automated, iterative reanalysis of genomic data, specifically targeting rare disease diagnosis. Developed through a collaboration including the Centre for Population Genomics and Microsoft, Talos efficiently re-examines stored sequencing data as scientific knowledge evolves, flagging variants with newly actionable evidence. Across a validation set of nearly 1,100 patients, it recovered 90% of in-scope diagnoses while flagging only 1.3 candidate variants per patient for expert review, demonstrating a low false-positive rate crucial for scalability. When deployed on a prospective cohort of almost 5,000 undiagnosed patients, Talos delivered 241 new diagnoses, representing a 5.1% additional yield. This system reduced the average time between supporting evidence becoming public and diagnosis to just 32 days. Monthly iterative cycles required analysts to review only one new variant per 200 patients, proving systematic reanalysis can be sustained.
Key takeaway
For clinical genomics labs or health systems aiming to improve rare disease diagnostic rates, implementing automated genomic reanalysis like Talos is crucial. You can significantly increase diagnostic yield by continuously re-evaluating patient data against evolving scientific knowledge, reducing the time to diagnosis. Consider deploying open-source solutions in cloud environments to manage the computational load and minimize manual review, making frequent reanalysis sustainable and cost-effective.
Key insights
Talos automates genomic reanalysis for rare diseases, significantly increasing diagnostic yield by utilizing evolving scientific knowledge.
Principles
- Optimize for specificity to reduce human review time.
- Continuously update variant knowledge from public resources.
- Focus on newly actionable evidence for iterative cycles.
Method
Talos re-interprets existing variant calls against PanelApp Australia and ClinVar, applying a prioritization algorithm. It refines candidates using family structure and phenotype, then reports only newly actionable variants.
In practice
- Integrate PanelApp Australia and ClinVar for current data.
- Prioritize variants based on ACMG/AMP criteria.
- Deploy in cloud environments like Azure for scalability.
Topics
- Genomic Reanalysis
- Rare Disease Diagnosis
- Variant Prioritization
- ClinVar
- PanelApp Australia
- Open-Source Tools
Code references
Best for: Research Scientist, AI Scientist, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Research.