Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping

· Source: Machine learning : nature.com subject feeds · Field: Agriculture & Food Systems — Artificial Intelligence & Machine Learning, Precision Agriculture & Smart Farming, Crop Science & Plant Technology · Depth: Advanced, medium

Summary

The Tomato Multi-Angle Multi-Pose (TomatoMAP) dataset is introduced to enhance the accuracy and reproducibility of fine-grained plant phenotyping for Solanum lycopersicum. This comprehensive dataset comprises 68,080 RGB images, including 3,616 high-resolution macrophotographs (3648 × 5472) with semantic annotations and 64,464 moderate-resolution images (1080 × 1440) captured from 12 plant poses at four camera elevations. Each image features manually annotated bounding boxes for seven regions of interest (leaves, panicle, flower clusters, fruit clusters, axillary shoot, shoot, and whole-plant area) and labels for 50 BBCH growth stages. The dataset supports real-time applications, with models like MobileNetv3, YOLOv11, and Mask R-CNN benchmarked for accuracy, mAP, and inference FPS. AI models trained on TomatoMAP achieved accuracy comparable to five domain experts, with reliability supported by Cohen’s Kappa statistics.

Key takeaway

For Computer Vision Engineers developing agricultural AI, TomatoMAP provides a robust, publicly available dataset to train and validate models for fine-grained tomato phenotyping. You should explore its multi-angle imagery and detailed annotations to improve the accuracy and reproducibility of your plant analysis systems, especially for real-time applications. The dataset's benchmarking against models like MobileNetv3 and YOLOv11 offers a strong baseline for your own model selection and optimization.

Key insights

TomatoMAP dataset enables accurate, reproducible fine-grained tomato phenotyping using multi-angle imagery and AI models.

Principles

Method

The dataset construction involves capturing images from 12 plant poses at four camera elevations, followed by manual annotation of seven regions of interest and 50 BBCH growth stages, then benchmarking AI models.

In practice

Topics

Code references

Best for: Computer Vision Engineer, Research Scientist, AI Researcher, AI Scientist, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Machine learning : nature.com subject feeds.