LG CLAug 18, 2025

TCUQ: Single-Pass Uncertainty Quantification from Temporal Consistency with Streaming Conformal Calibration for TinyML

Ismail Lamaakal, Chaymae Yahyati, Khalid El Makkaoui, Ibrahim Ouahbi, Yassine Maleh

arXiv:2508.12905v14.12 citationsh-index: 11

Originality Highly original

AI Analysis

This provides a practical and resource-efficient solution for on-device monitoring in TinyML, addressing the challenge of uncertainty quantification in resource-constrained environments.

The paper tackles the problem of uncertainty quantification for streaming TinyML by introducing TCUQ, a single-pass, label-free method that uses temporal consistency and streaming conformal calibration to produce calibrated risk scores, reducing footprint and latency by 50-60% and 30-45% compared to baselines while improving accuracy drop detection by 3-7 AUPRC points.

We introduce TCUQ, a single pass, label free uncertainty monitor for streaming TinyML that converts short horizon temporal consistency captured via lightweight signals on posteriors and features into a calibrated risk score with an O(W ) ring buffer and O(1) per step updates. A streaming conformal layer turns this score into a budgeted accept/abstain rule, yielding calibrated behavior without online labels or extra forward passes. On microcontrollers, TCUQ fits comfortably on kilobyte scale devices and reduces footprint and latency versus early exit and deep ensembles (typically about 50 to 60% smaller and about 30 to 45% faster), while methods of similar accuracy often run out of memory. Under corrupted in distribution streams, TCUQ improves accuracy drop detection by 3 to 7 AUPRC points and reaches up to 0.86 AUPRC at high severities; for failure detection it attains up to 0.92 AUROC. These results show that temporal consistency, coupled with streaming conformal calibration, provides a practical and resource efficient foundation for on device monitoring in TinyML.

View on arXiv PDF

Similar