๐Ÿ”ฌ Training Transformers to solve 95% failure rate of Cancer Trials โ€” Ron Alfa & Daniel Bear, Noetik

ยท Source: Latent.Space - Www.latent.space ยท Field: Technology & Digital โ€” Artificial Intelligence & Machine Learning, Life Sciences & Biology, Pharmaceuticals & Biotechnology ยท Depth: Advanced, extended

Summary

Noetik, a company co-founded by Ron Alfa and Dan Baer, is addressing the 90-95% failure rate of cancer drugs in clinical trials by focusing on patient selection rather than drug pharmacology or target selection. Their thesis posits that drugs fail because we are poor at identifying which patients will respond. Noetik generates extensive multimodal patient data, including H&E pathology, protein stains, and spatially resolved RNA (spatial transcriptomics), to train autoregressive transformer models like TARIO-2. These models aim to understand patient biology and identify therapeutically relevant cancer subtypes, enabling the selection of appropriate patient populations for clinical trials and potentially rescuing failed trials. The company emphasizes generating high-quality, curated datasets at scale, similar to ImageNet, to overcome limitations of existing public repositories and traditional drug development methods that rely on non-representative cell lines and animal models.

Key takeaway

For Directors of AI/ML in biopharma seeking to improve clinical trial success rates, Noetik's approach highlights that investing in large-scale, high-quality multimodal patient data generation and advanced transformer models can significantly enhance patient stratification. Your teams should explore integrating similar data-centric AI strategies to move beyond traditional, less predictive biomarkers and identify precise patient cohorts, thereby increasing the probability of drug efficacy and reducing trial failures.

Key insights

Improving cancer drug success requires advanced AI models for precise patient selection based on multimodal biological data.

Principles

Method

Noetik trains autoregressive transformer models on proprietary, large-scale multimodal patient data (H&E, protein stains, spatial transcriptomics) to identify therapeutically relevant cancer subtypes and simulate drug responses for patient stratification.

In practice

Topics

Best for: AI Scientist, Research Scientist, Director of AI/ML

Related on AIssential

Open in AIssential โ†’

Editorial summary, takeaway, and curation by AIssential. Original article published by Latent.Space - Www.latent.space.