A vision-based framework for human behavior understanding in industrial assembly lines
This addresses the need for automated monitoring and ergonomic assessment in manufacturing settings, though it appears incremental as it applies existing computer vision techniques to a new domain-specific dataset.
The paper tackles the problem of understanding human behavior in industrial assembly lines, specifically car door manufacturing, by developing a vision-based framework that estimates worker locations, 3D poses, and analyzes postures and task progress, with results showing effectiveness in classifying postures and monitoring progress.
This paper introduces a vision-based framework for capturing and understanding human behavior in industrial assembly lines, focusing on car door manufacturing. The framework leverages advanced computer vision techniques to estimate workers' locations and 3D poses and analyze work postures, actions, and task progress. A key contribution is the introduction of the CarDA dataset, which contains domain-relevant assembly actions captured in a realistic setting to support the analysis of the framework for human pose and action analysis. The dataset comprises time-synchronized multi-camera RGB-D videos, motion capture data recorded in a real car manufacturing environment, and annotations for EAWS-based ergonomic risk scores and assembly activities. Experimental results demonstrate the effectiveness of the proposed approach in classifying worker postures and robust performance in monitoring assembly task progress.