CVMay 27, 2025

IndustryEQA: Pushing the Frontiers of Embodied Question Answering in Industrial Scenarios

arXiv:2505.20640v16 citationsh-index: 10
Originality Synthesis-oriented
AI Analysis

This addresses the problem of evaluating embodied agents for real-world industrial applications, though it's incremental as it extends existing EQA frameworks to a new domain.

The authors tackled the lack of embodied question answering benchmarks for industrial settings by introducing IndustryEQA, a new benchmark with 1,344 question-answer pairs that evaluates agent capabilities in safety-critical warehouse scenarios.

Existing Embodied Question Answering (EQA) benchmarks primarily focus on household environments, often overlooking safety-critical aspects and reasoning processes pertinent to industrial settings. This drawback limits the evaluation of agent readiness for real-world industrial applications. To bridge this, we introduce IndustryEQA, the first benchmark dedicated to evaluating embodied agent capabilities within safety-critical warehouse scenarios. Built upon the NVIDIA Isaac Sim platform, IndustryEQA provides high-fidelity episodic memory videos featuring diverse industrial assets, dynamic human agents, and carefully designed hazardous situations inspired by real-world safety guidelines. The benchmark includes rich annotations covering six categories: equipment safety, human safety, object recognition, attribute recognition, temporal understanding, and spatial understanding. Besides, it also provides extra reasoning evaluation based on these categories. Specifically, it comprises 971 question-answer pairs generated from small warehouse and 373 pairs from large ones, incorporating scenarios with and without human. We further propose a comprehensive evaluation framework, including various baseline models, to assess their general perception and reasoning abilities in industrial environments. IndustryEQA aims to steer EQA research towards developing more robust, safety-aware, and practically applicable embodied agents for complex industrial environments. Benchmark and codes are available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes