CLAILGNov 9, 2025

Rep2Text: Decoding Full Text from a Single LLM Token Representation

arXiv:2511.06571v14 citationsh-index: 17
Originality Incremental advance
AI Analysis

This addresses the interpretability of LLMs for researchers, though it is incremental as it builds on existing representation and decoding methods.

The paper tackled the problem of recovering original input text from a single last-token representation in LLMs, proposing Rep2Text, which achieved over half of the information recovery in 16-token sequences on average while maintaining strong semantic integrity.

Large language models (LLMs) have achieved remarkable progress across diverse tasks, yet their internal mechanisms remain largely opaque. In this work, we address a fundamental question: to what extent can the original input text be recovered from a single last-token representation within an LLM? We propose Rep2Text, a novel framework for decoding full text from last-token representations. Rep2Text employs a trainable adapter that projects a target model's internal representations into the embedding space of a decoding language model, which then autoregressively reconstructs the input text. Experiments on various model combinations (Llama-3.1-8B, Gemma-7B, Mistral-7B-v0.1, Llama-3.2-3B) demonstrate that, on average, over half of the information in 16-token sequences can be recovered from this compressed representation while maintaining strong semantic integrity and coherence. Furthermore, our analysis reveals an information bottleneck effect: longer sequences exhibit decreased token-level recovery while preserving strong semantic integrity. Besides, our framework also demonstrates robust generalization to out-of-distribution medical data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes