Integration of alternative fragmentation techniques into standard LC-MS workflows using a single deep learning model enhances proteome coverage
Summary
A new integrated mass spectrometry platform and a unified deep learning model, Prosit_2025_intensity_MultiFrag, significantly enhance proteome coverage by incorporating alternative fragmentation techniques beyond traditional collision-induced dissociation (CID). Researchers developed an Orbitrap-Omnitrap hybrid instrument capable of automated collision-, electron- (ECD, EID), and photon-based (UVPD) fragmentation. They generated extensive multi-enzyme datasets to train the Prosit model, which predicts spectra across all these dissociation methods. Integrated into FragPipe's MSBooster module, this publicly available model increased protein identifications by over 10% on average for both data-dependent (DDA) and data-independent acquisition (DIA) across all techniques. The study demonstrates that electron-induced and ultraviolet photodissociation, which yield richer, more informative spectra, achieve identification efficiency competitive with CID while providing superior sequence coverage, establishing a framework for routine application of advanced fragmentation in proteomics.
Key takeaway
For AI Scientists developing proteomics pipelines, you should integrate the Prosit_2025_intensity_MultiFrag model into your workflows, particularly for analyzing data from electron-induced and ultraviolet photodissociation. This model, available via Koina and integrated into FragPipe's MSBooster, significantly boosts protein identification rates and sequence coverage, especially for challenging proteoforms. Adopting these advanced fragmentation techniques and the associated deep learning tools will enable more comprehensive and confident proteome characterization, moving beyond the limitations of traditional CID methods.
Key insights
A unified deep learning model and integrated MS platform enhance proteome coverage by leveraging diverse fragmentation techniques.
Principles
- Alternative fragmentation methods yield richer spectra.
- Deep learning can unify diverse spectral prediction.
- Rescoring improves identification accuracy and recovery.
Method
An Orbitrap-Omnitrap hybrid instrument was developed for automated multi-modal fragmentation. Large-scale multi-enzyme LC-MS data trained a unified Prosit deep learning model for pan-fragmentation spectrum prediction, integrated into FragPipe's MSBooster for rescoring.
In practice
- Utilize Prosit_2025_intensity_MultiFrag for enhanced peptide ID.
- Employ alternative fragmentation (UVPD, ExD) for complex proteoforms.
- Integrate multi-enzyme digests for broader proteome coverage.
Topics
- Deep Learning Proteomics
- Mass Spectrometry Fragmentation
- Prosit Model
- Proteome Coverage Enhancement
- Peptide Spectrum Prediction
Code references
Best for: AI Scientist, AI Researcher, Machine Learning Engineer, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.