CVNov 10, 2023

Keystroke Verification Challenge (KVC): Biometric and Fairness Benchmark Evaluation

arXiv:2311.06000v311 citationsh-index: 58
Originality Incremental advance
AI Analysis

This work addresses the problem of inconsistent evaluation and fairness in keystroke biometrics for researchers and practitioners, though it is incremental as it builds on existing systems like TypeNet and TypeFormer.

The authors tackled the lack of standardized benchmarks in keystroke dynamics biometric verification by introducing the Keystroke Verification Challenge (KVC), a framework based on data from over 185,000 subjects, and a new fairness metric called Skewed Impostor Ratio (SIR). They demonstrated that using time-domain features instead of text content maintains satisfactory performance while enhancing privacy.

Analyzing keystroke dynamics (KD) for biometric verification has several advantages: it is among the most discriminative behavioral traits; keyboards are among the most common human-computer interfaces, being the primary means for users to enter textual data; its acquisition does not require additional hardware, and its processing is relatively lightweight; and it allows for transparently recognizing subjects. However, the heterogeneity of experimental protocols and metrics, and the limited size of the databases adopted in the literature impede direct comparisons between different systems, thus representing an obstacle in the advancement of keystroke biometrics. To alleviate this aspect, we present a new experimental framework to benchmark KD-based biometric verification performance and fairness based on tweet-long sequences of variable transcript text from over 185,000 subjects, acquired through desktop and mobile keyboards, extracted from the Aalto Keystroke Databases. The framework runs on CodaLab in the form of the Keystroke Verification Challenge (KVC). Moreover, we also introduce a novel fairness metric, the Skewed Impostor Ratio (SIR), to capture inter- and intra-demographic group bias patterns in the verification scores. We demonstrate the usefulness of the proposed framework by employing two state-of-the-art keystroke verification systems, TypeNet and TypeFormer, to compare different sets of input features, achieving a less privacy-invasive system, by discarding the analysis of text content (ASCII codes of the keys pressed) in favor of extended features in the time domain. Our experiments show that this approach allows to maintain satisfactory performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes