A finely annotated dataset for the automated acoustic identification of European Orthoptera and Cicadidae

· Source: Machine learning : nature.com subject feeds · Field: Science & Research — Life Sciences & Biology, Environmental Science & Earth Systems, Artificial Intelligence & Machine Learning · Depth: Advanced, long

Summary

A new dataset, ECOSoundSet, has been released to advance automated acoustic identification of European Orthoptera and Cicadidae, addressing the urgent need for large-scale insect biodiversity monitoring. Published on April 7, 2026, this dataset comprises 11,224 recordings covering 193 orthopteran and 24 cicada species from North, Central, and temperate Western Europe. It integrates both weakly labeled recordings, indicating species presence, and strongly labeled recordings, which precisely specify the time and frequency range of each insect sound. This resource, deposited in a Zenodo repository (DOI: 10.5281/zenodo.15043892), complements existing online biodiversity data and supports the development of robust algorithms for insect sound classification, crucial for tracking widespread declines in insect populations.

Key takeaway

For AI Scientists and Machine Learning Engineers developing biodiversity monitoring tools, this new ECOSoundSet dataset offers a critical resource. Its combination of weak and strong acoustic labels for 217 European insect species can significantly improve the robustness and accuracy of your automated recognition algorithms. You should integrate this dataset into your model training and validation workflows to enhance species identification capabilities, thereby contributing to more effective large-scale insect population monitoring efforts.

Key insights

A new dataset combines weak and strong acoustic labels for automated European insect identification.

Principles

Method

The dataset combines coarsely labeled recordings (species presence inferred) with finely annotated recordings (time and frequency range specified) to support automated acoustic classification.

In practice

Topics

Code references

Best for: AI Scientist, Machine Learning Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.