CVDec 16, 2024

Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach

arXiv:2412.11668v25.22 citationsh-index: 7Has CodeIEEE Trans Inf Forensics Secur

Originality Highly original

AI Analysis

This work addresses the need for effective retrieval systems in online handwriting, particularly for Chinese phrases, by providing a novel method and dataset to overcome scarcity in the field.

The paper tackles the problem of online writer retrieval for Chinese handwritten phrases by proposing DOLPHIN, a retrieval model that uses synergistic temporal-frequency analysis, and introduces OLIWER, a large-scale dataset with over 670,000 phrases from 1,731 individuals, demonstrating superior performance over existing methods.

Currently, the prevalence of online handwriting has spurred a critical need for effective retrieval systems to accurately search relevant handwriting instances from specific writers, known as online writer retrieval. Despite the growing demand, this field suffers from a scarcity of well-established methodologies and public large-scale datasets. This paper tackles these challenges with a focus on Chinese handwritten phrases. First, we propose DOLPHIN, a novel retrieval model designed to enhance handwriting representations through synergistic temporal-frequency analysis. For frequency feature learning, we propose the HFGA block, which performs gated cross-attention between the vanilla temporal handwriting sequence and its high-frequency sub-bands to amplify salient writing details. For temporal feature learning, we propose the CAIR block, tailored to promote channel interaction and reduce channel redundancy. Second, to address data deficit, we introduce OLIWER, a large-scale online writer retrieval dataset encompassing over 670,000 Chinese handwritten phrases from 1,731 individuals. Through extensive evaluations, we demonstrate the superior performance of DOLPHIN over existing methods. In addition, we explore cross-domain writer retrieval and reveal the pivotal role of increasing feature alignment in bridging the distributional gap between different handwriting data. Our findings emphasize the significance of point sampling frequency and pressure features in improving handwriting representation quality and retrieval performance. Code and dataset are available at https://github.com/SCUT-DLVCLab/DOLPHIN.

View on arXiv PDF Code

Similar