CVJul 15, 2025

Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping

Yujie Zhang, Sabine Struckmeyer, Andreas Kolb, Sven Reichardt

arXiv:2507.11279v13.6h-index: 3Sci Data

Originality Synthesis-oriented

AI Analysis

This provides a standardized dataset for fine-grained phenotyping of tomatoes, addressing reproducibility issues in plant science, but it is incremental as it focuses on a specific domain.

The researchers tackled the problem of observer bias and inconsistency in plant phenotyping by creating TomatoMAP, a dataset with 64,464 RGB images and fine-grained annotations, and validated it with deep learning models achieving accuracy and speed comparable to human experts.

Observer bias and inconsistencies in traditional plant phenotyping methods limit the accuracy and reproducibility of fine-grained plant analysis. To overcome these challenges, we developed TomatoMAP, a comprehensive dataset for Solanum lycopersicum using an Internet of Things (IoT) based imaging system with standardized data acquisition protocols. Our dataset contains 64,464 RGB images that capture 12 different plant poses from four camera elevation angles. Each image includes manually annotated bounding boxes for seven regions of interest (ROIs), including leaves, panicle, batch of flowers, batch of fruits, axillary shoot, shoot and whole plant area, along with 50 fine-grained growth stage classifications based on the BBCH scale. Additionally, we provide 3,616 high-resolution image subset with pixel-wise semantic and instance segmentation annotations for fine-grained phenotyping. We validated our dataset using a cascading model deep learning framework combining MobileNetv3 for classification, YOLOv11 for object detection, and MaskRCNN for segmentation. Through AI vs. Human analysis involving five domain experts, we demonstrate that the models trained on our dataset achieve accuracy and speed comparable to the experts. Cohen's Kappa and inter-rater agreement heatmap confirm the reliability of automated fine-grained phenotyping using our approach.

View on arXiv PDF

Similar