AIMay 30, 2025

SMELLNET: A Large-scale Dataset for Real-world Smell Recognition

arXiv:2506.00239v211 citationsh-index: 3
Originality Synthesis-oriented
AI Analysis

This addresses the problem of enabling real-world olfactory applications in healthcare, food, and other domains, but it is incremental as it introduces a new dataset and method rather than a fundamental breakthrough.

The paper tackles the lack of large-scale benchmarks for AI smell recognition by creating SmellNet, a dataset with 828,000 data points across 50 substances and 43 mixtures, and develops ScentFormer, which achieves 58.5% Top-1 accuracy on classification and 50.2% Top-1@0.1 on mixture prediction.

The ability of AI to sense and identify various substances based on their smell alone can have profound impacts on allergen detection (e.g., smelling gluten or peanuts in a cake), monitoring the manufacturing process, and sensing hormones that indicate emotional states, stress levels, and diseases. Despite these broad impacts, there are virtually no large-scale benchmarks, and therefore little progress, for training and evaluating AI systems' ability to smell in the real world. In this paper, we use small gas and chemical sensors to create SmellNet, the first large-scale database that digitizes a diverse range of smells in the natural world. SmellNet contains about 828,000 data points across 50 substances, spanning nuts, spices, herbs, fruits, and vegetables, and 43 mixtures among them, with 68 hours of data collected. Using SmellNet, we developed ScentFormer, a Transformer-based architecture combining temporal differencing and sliding-window augmentation for smell data. For the SmellNet-Base classification task, ScentFormer achieves 58.5% Top-1 accuracy, and for the SmellNet-Mixture distribution prediction task, ScentFormer achieves 50.2% Top-1@0.1 on the test-seen split. ScentFormer's ability to generalize across conditions and capture transient chemical dynamics demonstrates the promise of temporal modeling in olfactory AI. SmellNet and ScentFormer lay the groundwork for real-world olfactory applications across healthcare, food and beverage, environmental monitoring, manufacturing, and entertainment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes