CVAug 17, 2020

WSRNet: Joint Spotting and Recognition of Handwritten Words

arXiv:2008.07109v11 citations
Originality Incremental advance
AI Analysis

This work addresses the need for efficient and accurate processing of handwritten documents, offering a combined solution for researchers and practitioners in document analysis, though it is incremental as it builds on existing methods.

The paper tackles the problem of joint keyword spotting and word recognition in handwritten text by proposing a unified model with a non-recurrent CTC branch and a Seq2Seq branch with an autoencoding module, resulting in state-of-the-art performance in keyword spotting and competitive results in word recognition.

In this work, we present a unified model that can handle both Keyword Spotting and Word Recognition with the same network architecture. The proposed network is comprised of a non-recurrent CTC branch and a Seq2Seq branch that is further augmented with an Autoencoding module. The related joint loss leads to a boost in recognition performance, while the Seq2Seq branch is used to create efficient word representations. We show how to further process these representations with binarization and a retraining scheme to provide compact and highly efficient descriptors, suitable for keyword spotting. Numerical results validate the usefulness of the proposed architecture, as our method outperforms the previous state-of-the-art in keyword spotting, and provides results in the ballpark of the leading methods for word recognition.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes