ROCVJan 10, 2025

Semantic Mapping in Indoor Embodied AI -- A Survey on Advances, Challenges, and Future Directions

arXiv:2501.05750v38 citationsh-index: 46Trans. Mach. Learn. Res.
Originality Synthesis-oriented
AI Analysis

This is an incremental survey that synthesizes existing research to guide future work in semantic mapping for embodied AI systems.

This paper provides a comprehensive survey of semantic map-building approaches in embodied AI for indoor navigation, categorizing them by structural representation and information type, and identifies trends toward open-vocabulary, queryable maps while noting challenges like high memory demands.

Intelligent embodied agents (e.g. robots) need to perform complex semantic tasks in unfamiliar environments. Among many skills that the agents need to possess, building and maintaining a semantic map of the environment is most crucial in long-horizon tasks. A semantic map captures information about the environment in a structured way, allowing the agent to reference it for advanced reasoning throughout the task. While existing surveys in embodied AI focus on general advancements or specific tasks like navigation and manipulation, this paper provides a comprehensive review of semantic map-building approaches in embodied AI, specifically for indoor navigation. We categorize these approaches based on their structural representation (spatial grids, topological graphs, dense point-clouds or hybrid maps) and the type of information they encode (implicit features or explicit environmental data). We also explore the strengths and limitations of the map building techniques, highlight current challenges, and propose future research directions. We identify that the field is moving towards developing open-vocabulary, queryable, task-agnostic map representations, while high memory demands and computational inefficiency still remaining to be open challenges. This survey aims to guide current and future researchers in advancing semantic mapping techniques for embodied AI systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes