CLAILGOct 17, 2024

Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation

arXiv:2410.13640v257 citationsh-index: 9ICLR
Originality Incremental advance
AI Analysis

This addresses the need for reliable LLM deployment by providing a label-free, real-time self-evaluation method, though it appears incremental as it builds on existing latent space analysis.

The paper tackles the problem of LLM self-evaluation by proposing Chain-of-Embedding (CoE) in the latent space to enable output-free estimation of response correctness, achieving effectiveness across four domains and seven LLMs with millisecond-level computational cost.

LLM self-evaluation relies on the LLM's own ability to estimate response correctness, which can greatly improve its deployment reliability. In this research track, we propose the Chain-of-Embedding (CoE) in the latent space to enable LLMs to perform output-free self-evaluation. CoE consists of all progressive hidden states produced during the inference time, which can be treated as the latent thinking path of LLMs. We find that when LLMs respond correctly and incorrectly, their CoE features differ, these discrepancies assist us in estimating LLM response correctness. Experiments in four diverse domains and seven LLMs fully demonstrate the effectiveness of our method. Meanwhile, its label-free design intent without any training and millisecond-level computational cost ensures real-time feedback in large-scale scenarios. More importantly, we provide interesting insights into LLM response correctness from the perspective of hidden state changes inside LLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes