Token-Efficient Change Detection in LLM APIs
This addresses the need for low-cost, scalable change detection in LLM APIs without requiring access to model internals, though it is incremental as it builds on statistical insights for a specific domain.
The paper tackled the problem of remote change detection in LLMs by developing a black-box method using Border Inputs, achieving performance comparable to grey-box approaches while reducing costs by 30 times.
Remote change detection in LLMs is a difficult problem. Existing methods are either too expensive for deployment at scale, or require initial white-box access to model weights or grey-box access to log probabilities. We aim to achieve both low cost and strict black-box operation, observing only output tokens. Our approach hinges on specific inputs we call Border Inputs, for which there exists more than one output top token. From a statistical perspective, optimal change detection depends on the model's Jacobian and the Fisher information of the output distribution. Analyzing these quantities in low-temperature regimes shows that border inputs enable powerful change detection tests. Building on this insight, we propose the Black-Box Border Input Tracking (B3IT) scheme. Extensive in-vivo and in-vitro experiments show that border inputs are easily found for non-reasoning tested endpoints, and achieve performance on par with the best available grey-box approaches. B3IT reduces costs by $30\times$ compared to existing methods, while operating in a strict black-box setting.