๐ฌ Training Transformers to solve 95% failure rate of Cancer Trials โ Ron Alfa & Daniel Bear, Noetik
Summary
Noetik, a company co-founded by Ron Alfa and Dan Baer, is addressing the 90-95% failure rate of cancer drugs in clinical trials by focusing on patient selection rather than drug pharmacology or target selection. Their thesis posits that drugs fail because we are poor at identifying which patients will respond. Noetik generates extensive multimodal patient data, including H&E pathology, protein stains, and spatially resolved RNA (spatial transcriptomics), to train autoregressive transformer models like TARIO-2. These models aim to understand patient biology and identify therapeutically relevant cancer subtypes, enabling the selection of appropriate patient populations for clinical trials and potentially rescuing failed trials. The company emphasizes generating high-quality, curated datasets at scale, similar to ImageNet, to overcome limitations of existing public repositories and traditional drug development methods that rely on non-representative cell lines and animal models.
Key takeaway
For Directors of AI/ML in biopharma seeking to improve clinical trial success rates, Noetik's approach highlights that investing in large-scale, high-quality multimodal patient data generation and advanced transformer models can significantly enhance patient stratification. Your teams should explore integrating similar data-centric AI strategies to move beyond traditional, less predictive biomarkers and identify precise patient cohorts, thereby increasing the probability of drug efficacy and reducing trial failures.
Key insights
Improving cancer drug success requires advanced AI models for precise patient selection based on multimodal biological data.
Principles
- Drug failure often stems from poor patient selection.
- Multimodal patient data is crucial for understanding disease subtypes.
- Data scale and quality are paramount for effective AI in biology.
Method
Noetik trains autoregressive transformer models on proprietary, large-scale multimodal patient data (H&E, protein stains, spatial transcriptomics) to identify therapeutically relevant cancer subtypes and simulate drug responses for patient stratification.
In practice
- Use H&E images for diagnostic prediction of drug response.
- Simulate gene knockdowns to predict drug efficacy.
- Leverage spatial transcriptomics for detailed molecular insights.
Topics
- Cancer Clinical Trials
- Patient Selection
- Multimodal Biomedical Data
- Autoregressive Transformers
- Spatial Transcriptomics
Best for: AI Scientist, Research Scientist, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Latent.Space - Www.latent.space.