Introducing Wake Vision: A High-Quality, Large-Scale Dataset for TinyML Computer Vision Applications

· Source: The TensorFlow Blog · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Internet of Things (IoT) & Connected Devices · Depth: Intermediate, short

Summary

Harvard University researchers Colby Banbury, Emil Njor, Andrea Mattia Garavagno, and Vijay Janapa Reddi have released Wake Vision, a new large-scale dataset designed to accelerate TinyML research, specifically for person detection on low-power microcontrollers and edge devices. Wake Vision, with approximately 6 million images, is nearly 100 times larger than the previous Visual Wake Words (VWW) dataset and offers two distinct training sets: Wake Vision (Large) for size and Wake Vision (Quality) for label accuracy. The dataset includes fine-grained benchmarks to evaluate model performance across real-world conditions like distance, lighting, depictions, and perceived gender/age, helping identify biases. Models trained with Wake Vision show up to a 6.6% accuracy increase over VWW and an error rate reduction from 7.8% to 2.2% with manual validation. The dataset is accessible via TensorFlow Datasets, Hugging Face Datasets, and Edge AI Labs under a CC-BY 4.0 license.

Key takeaway

For AI Scientists developing TinyML models for person detection, Wake Vision offers a critical resource to overcome data limitations. Your models can achieve significant accuracy gains and reduced error rates by leveraging this large, high-quality dataset and its fine-grained benchmarks. Consider using the combined pre-training and fine-tuning approach with Wake Vision's distinct datasets to optimize performance and identify biases early in your design process.

Key insights

High-quality, large-scale datasets like Wake Vision are crucial for advancing TinyML person detection.

Principles

Method

Pre-training with a large dataset and fine-tuning with a high-quality dataset yields optimal results for TinyML models, especially for person detection.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, AI Researcher, AI Engineer, Machine Learning Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by The TensorFlow Blog.