Mapillary Training Datasets - AI at Meta
Summary
Mapillary offers a suite of training datasets derived from over two billion street-level images contributed globally via smartphones and action cameras. These datasets support the development of recognition models by providing diverse benchmarking data, including sequences, depth data, outdoor imagery, city-level imagery, and semantic segmentations. Key datasets include Vistas, CrowdDriven, Metropolis, Planet-Scale Depth, Street-Level Sequences, and Traffic Sign. Established in 2013 and acquired by Facebook (now Meta) in 2020, Mapillary continues to provide public access to street-level imagery and map data, leveraging computer vision and machine learning to extract map features for various mapping projects.
Key takeaway
For research scientists developing computer vision models, Mapillary's extensive and diverse street-level datasets offer a critical resource for training and benchmarking. You should explore specific datasets like Vistas or Planet-Scale Depth to enhance model robustness and accuracy, particularly for urban mapping, autonomous driving, or environmental analysis applications.
Key insights
Mapillary provides diverse, global street-level image datasets for training computer vision models.
Principles
- Crowdsourcing enhances data diversity
- Computer vision extracts map features
Method
Mapillary collects street-level images from a global network of contributors, then uses computer vision to extract map data and create diverse training datasets for recognition models.
In practice
- Train recognition models on street-level data
- Utilize semantic segmentations for mapping
- Access depth data for 3D applications
Topics
- Mapillary
- Training Datasets
- Street-Level Imagery
- Computer Vision
- Semantic Segmentation
Best for: Research Scientist, Computer Vision Engineer, Machine Learning Engineer, AI Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by ai.meta.com via Google News.