CLAILGOct 3, 2025

Time-To-Inconsistency: A Survival Analysis of Large Language Model Robustness to Adversarial Attacks

arXiv:2510.02712v15 citationsh-index: 47
Originality Highly original
AI Analysis

This work addresses the lack of temporal robustness evaluation in conversational AI, offering a new paradigm for designing resilient agents, though it is incremental in applying survival analysis to this domain.

The study tackled the problem of evaluating Large Language Models' robustness in extended multi-turn dialogues by conducting a survival analysis on 36,951 conversation turns across 9 LLMs, revealing that abrupt semantic drift increases failure risk while gradual drift reduces it, enabling longer dialogues.

Large Language Models (LLMs) have revolutionized conversational AI, yet their robustness in extended multi-turn dialogues remains poorly understood. Existing evaluation frameworks focus on static benchmarks and single-turn assessments, failing to capture the temporal dynamics of conversational degradation that characterize real-world interactions. In this work, we present the first comprehensive survival analysis of conversational AI robustness, analyzing 36,951 conversation turns across 9 state-of-the-art LLMs to model failure as a time-to-event process. Our survival modeling framework-employing Cox proportional hazards, Accelerated Failure Time, and Random Survival Forest approaches-reveals extraordinary temporal dynamics. We find that abrupt, prompt-to-prompt(P2P) semantic drift is catastrophic, dramatically increasing the hazard of conversational failure. In stark contrast, gradual, cumulative drift is highly protective, vastly reducing the failure hazard and enabling significantly longer dialogues. AFT models with interactions demonstrate superior performance, achieving excellent discrimination and exceptional calibration. These findings establish survival analysis as a powerful paradigm for evaluating LLM robustness, offer concrete insights for designing resilient conversational agents, and challenge prevailing assumptions about the necessity of semantic consistency in conversational AI Systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes