CLAILGJun 5, 2023

A Study of Situational Reasoning for Traffic Understanding

CMU
arXiv:2306.02520v223 citationsh-index: 27
AI Analysis

This work addresses the assessment gap in traffic monitoring for improving road safety and smart city infrastructure, but it is incremental as it builds on prior benchmarks and methods.

The paper tackled the problem of assessing whether models can effectively align perceptual information with domain-specific and causal commonsense knowledge for traffic understanding by devising three novel text-based tasks (BDD-QA, TV-QA, HDT-QA) and benchmarking knowledge-enhanced methods under zero-shot evaluation, resulting in in-depth analyses of model performance across data partitions and predictions.

Intelligent Traffic Monitoring (ITMo) technologies hold the potential for improving road safety/security and for enabling smart city infrastructure. Understanding traffic situations requires a complex fusion of perceptual information with domain-specific and causal commonsense knowledge. Whereas prior work has provided benchmarks and methods for traffic monitoring, it remains unclear whether models can effectively align these information sources and reason in novel scenarios. To address this assessment gap, we devise three novel text-based tasks for situational reasoning in the traffic domain: i) BDD-QA, which evaluates the ability of Language Models (LMs) to perform situational decision-making, ii) TV-QA, which assesses LMs' abilities to reason about complex event causality, and iii) HDT-QA, which evaluates the ability of models to solve human driving exams. We adopt four knowledge-enhanced methods that have shown generalization capability across language reasoning tasks in prior work, based on natural language inference, commonsense knowledge-graph self-supervision, multi-QA joint training, and dense retrieval of domain information. We associate each method with a relevant knowledge source, including knowledge graphs, relevant benchmarks, and driving manuals. In extensive experiments, we benchmark various knowledge-aware methods against the three datasets, under zero-shot evaluation; we provide in-depth analyses of model performance on data partitions and examine model predictions categorically, to yield useful insights on traffic understanding, given different background knowledge and reasoning strategies.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes