CRCLLGMar 25

How Vulnerable Are Edge LLMs?

arXiv:2603.2382232.9h-index: 4
AI Analysis

This reveals a previously underexplored security risk for edge-deployed LLMs, which is incremental as it builds on existing extraction methods but applies them to quantized models.

The paper tackles the problem of query-based knowledge extraction from quantized LLMs on edge devices, showing that quantization does not remove semantic knowledge, allowing substantial behavioral recovery with CLIQ, which outperforms original queries in metrics like BERTScore, BLEU, and ROUGE under limited query budgets.

Large language models (LLMs) are increasingly deployed on edge devices under strict computation and quantization constraints, yet their security implications remain unclear. We study query-based knowledge extraction from quantized edge-deployed LLMs under realistic query budgets and show that, although quantization introduces noise, it does not remove the underlying semantic knowledge, allowing substantial behavioral recovery through carefully designed queries. To systematically analyze this risk, we propose \textbf{CLIQ} (\textbf{Cl}ustered \textbf{I}nstruction \textbf{Q}uerying), a structured query construction framework that improves semantic coverage while reducing redundancy. Experiments on quantized Qwen models (INT8/INT4) demonstrate that CLIQ consistently outperforms original queries across BERTScore, BLEU, and ROUGE, enabling more efficient extraction under limited budgets. These results indicate that quantization alone does not provide effective protection against query-based extraction, highlighting a previously underexplored security risk in edge-deployed LLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes