Out of Sight But Not Out of Mind: An Answer Set Programming Based Online Abduction Framework for Visual Sensemaking in Autonomous Driving
It addresses safety-critical autonomous driving situations by improving human-centred visual sensemaking, though it appears incremental as it builds on existing methods.
The paper tackles visual sensemaking in autonomous driving by developing an online abduction framework using answer set programming, integrating deep learning for visual computing and evaluating on benchmarks like KITTIMOD and MOT to enhance semantic representation and explainability.
We demonstrate the need and potential of systematically integrated vision and semantics} solutions for visual sensemaking (in the backdrop of autonomous driving). A general method for online visual sensemaking using answer set programming is systematically formalised and fully implemented. The method integrates state of the art in (deep learning based) visual computing, and is developed as a modular framework usable within hybrid architectures for perception & control. We evaluate and demo with community established benchmarks KITTIMOD and MOT. As use-case, we focus on the significance of human-centred visual sensemaking ---e.g., semantic representation and explainability, question-answering, commonsense interpolation--- in safety-critical autonomous driving situations.