Talos: Scaling rare disease diagnosis with automated, iterative genomic reanalysis

· Source: Microsoft Research · Field: Health & Wellbeing — Clinical Care & Medical Practice, Medical Devices & Health Technology, Artificial Intelligence & Machine Learning · Depth: Advanced, medium

Summary

Talos is an open-source tool designed for automated, iterative reanalysis of genomic data, specifically targeting rare disease diagnosis. Developed through a collaboration including the Centre for Population Genomics and Microsoft, Talos efficiently re-examines stored sequencing data as scientific knowledge evolves, flagging variants with newly actionable evidence. Across a validation set of nearly 1,100 patients, it recovered 90% of in-scope diagnoses while flagging only 1.3 candidate variants per patient for expert review, demonstrating a low false-positive rate crucial for scalability. When deployed on a prospective cohort of almost 5,000 undiagnosed patients, Talos delivered 241 new diagnoses, representing a 5.1% additional yield. This system reduced the average time between supporting evidence becoming public and diagnosis to just 32 days. Monthly iterative cycles required analysts to review only one new variant per 200 patients, proving systematic reanalysis can be sustained.

Key takeaway

For clinical genomics labs or health systems aiming to improve rare disease diagnostic rates, implementing automated genomic reanalysis like Talos is crucial. You can significantly increase diagnostic yield by continuously re-evaluating patient data against evolving scientific knowledge, reducing the time to diagnosis. Consider deploying open-source solutions in cloud environments to manage the computational load and minimize manual review, making frequent reanalysis sustainable and cost-effective.

Key insights

Talos automates genomic reanalysis for rare diseases, significantly increasing diagnostic yield by utilizing evolving scientific knowledge.

Principles

Method

Talos re-interprets existing variant calls against PanelApp Australia and ClinVar, applying a prioritization algorithm. It refines candidates using family structure and phenotype, then reports only newly actionable variants.

In practice

Topics

Code references

Best for: Research Scientist, AI Scientist, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Research.