NCLGJun 20, 2025

Sequence-to-Sequence Models with Attention Mechanistically Map to the Architecture of Human Memory Search

arXiv:2506.17424v11 citationsh-index: 1Communications Psychology
Originality Incremental advance
AI Analysis

This work provides insights into why humans develop context-based memory architectures by linking them to optimized AI models, potentially advancing cognitive modeling.

The paper demonstrates that RNN-based sequence-to-sequence models with attention in neural machine translation mechanistically align with the Context Maintenance and Retrieval model of human memory, showing that this convergence helps explain the functional role of context in memory and enables modeling human memory search effectively.

Past work has long recognized the important role of context in guiding how humans search their memory. While context-based memory models can explain many memory phenomena, it remains unclear why humans develop such architectures over possible alternatives in the first place. In this work, we demonstrate that foundational architectures in neural machine translation -- specifically, recurrent neural network (RNN)-based sequence-to-sequence models with attention -- exhibit mechanisms that directly correspond to those specified in the Context Maintenance and Retrieval (CMR) model of human memory. Since neural machine translation models have evolved to optimize task performance, their convergence with human memory models provides a deeper understanding of the functional role of context in human memory, as well as presenting new ways to model human memory. Leveraging this convergence, we implement a neural machine translation model as a cognitive model of human memory search that is both interpretable and capable of capturing complex dynamics of learning. We show that our model accounts for both averaged and optimal human behavioral patterns as effectively as context-based memory models. Further, we demonstrate additional strengths of the proposed model by evaluating how memory search performance emerges from the interaction of different model components.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes