CLHCJul 11, 2024

Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

arXiv:2407.08219v111 citationsh-index: 6
Originality Incremental advance
AI Analysis

This addresses navigation challenges for blind and low-vision people, but it is incremental as it builds on existing grounded instruction generation methods.

The paper tackled the problem of generating navigation instructions for blind and low-vision individuals in unfamiliar environments, and the result showed that large pretrained language models can produce correct and useful instructions, as demonstrated through a sighted user study and insights from BLV users.

Navigating unfamiliar environments presents significant challenges for blind and low-vision (BLV) individuals. In this work, we construct a dataset of images and goals across different scenarios such as searching through kitchens or navigating outdoors. We then investigate how grounded instruction generation methods can provide contextually-relevant navigational guidance to users in these instances. Through a sighted user study, we demonstrate that large pretrained language models can produce correct and useful instructions perceived as beneficial for BLV users. We also conduct a survey and interview with 4 BLV users and observe useful insights on preferences for different instructions based on the scenario.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes