How AI trained on birds is surfacing underwater mysteries
Summary
Google DeepMind's Perch 2.0, a bioacoustics foundation model, demonstrates surprising efficacy in classifying underwater marine vocalizations, despite being trained primarily on bird and terrestrial animal sounds. Released in August 2025, Perch 2.0 was evaluated using a few-shot linear probe on marine tasks, including distinguishing baleen whale species and killer whale subpopulations, against models like Perch 1.0, SurfPerch, and a multispecies whale model. Utilizing datasets such as NOAA PIPAN, ReefSet, and DCLDE, Perch 2.0 consistently performed as the top or second-best model across various sample sizes, outperforming AVES-bird and AVES-bio on most tasks. This transfer learning capability is attributed to the model's large size, extensive training data, and its ability to learn detailed acoustic features from complex bird calls, which generalize well to other bioacoustics challenges.
Key takeaway
For bioacoustics researchers and marine conservationists aiming to classify new or elusive underwater sounds, Perch 2.0 offers a highly efficient transfer learning solution. You can leverage its pre-trained embeddings, even without underwater training data, to rapidly develop custom classifiers with minimal computational resources. Explore the provided Colab demo to implement this agile modeling workflow for your specific marine species monitoring needs, potentially accelerating discovery and conservation efforts.
Key insights
Terrestrial bioacoustics models can effectively transfer learning to underwater marine sound classification tasks.
Principles
- Larger models generalize better across diverse soundscapes.
- Detailed feature learning from complex sounds aids transferability.
Method
Apply a pre-trained bioacoustics model to generate embeddings from audio data, then train a simple logistic regression classifier on these embeddings for custom classification.
In practice
- Use Perch 2.0 for whale vocalization classification.
- Create custom classifiers with few labeled examples.
- Access NOAA data via Google Cloud for marine acoustics.
Topics
- Bioacoustics Foundation Models
- Transfer Learning
- Marine Bioacoustics
- Whale Vocalization
- Perch 2.0
Code references
Best for: AI Researcher, Data Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The latest research from Google.