CVAIMar 12, 2024

From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios

arXiv:2403.07403v115 citationsh-index: 32IEEE transactions on multimedia
Originality Synthesis-oriented
AI Analysis

This addresses a domain gap issue for health management applications, but it is incremental as it builds on existing datasets and methods.

The paper tackles the problem of food recognition models trained on curated canteen datasets failing in daily-life scenarios by introducing new benchmarks (DailyFood-172 and DailyFood-16) and a baseline method (MCRL), which improves performance when integrated with existing approaches.

The precise recognition of food categories plays a pivotal role for intelligent health management, attracting significant research attention in recent years. Prominent benchmarks, such as Food-101 and VIREO Food-172, provide abundant food image resources that catalyze the prosperity of research in this field. Nevertheless, these datasets are well-curated from canteen scenarios and thus deviate from food appearances in daily life. This discrepancy poses great challenges in effectively transferring classifiers trained on these canteen datasets to broader daily-life scenarios encountered by humans. Toward this end, we present two new benchmarks, namely DailyFood-172 and DailyFood-16, specifically designed to curate food images from everyday meals. These two datasets are used to evaluate the transferability of approaches from the well-curated food image domain to the everyday-life food image domain. In addition, we also propose a simple yet effective baseline method named Multi-Cluster Reference Learning (MCRL) to tackle the aforementioned domain gap. MCRL is motivated by the observation that food images in daily-life scenarios exhibit greater intra-class appearance variance compared with those in well-curated benchmarks. Notably, MCRL can be seamlessly coupled with existing approaches, yielding non-trivial performance enhancements. We hope our new benchmarks can inspire the community to explore the transferability of food recognition models trained on well-curated datasets toward practical real-life applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes