CVMay 13, 2013

Unsupervised ensemble of experts (EoE) framework for automatic binarization of document images

arXiv:1305.2949v214 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of inconsistent binarization performance across different methods for document analysis, though it is incremental as it builds on existing ensemble techniques.

The paper tackles the problem of document image binarization by introducing an unsupervised ensemble of experts (EoE) framework that combines outputs from various methods, achieving promising performance on benchmarks like H-DIBCO'12 and grid-based Sauvola instances.

In recent years, a large number of binarization methods have been developed, with varying performance generalization and strength against different benchmarks. In this work, to leverage on these methods, an ensemble of experts (EoE) framework is introduced, to efficiently combine the outputs of various methods. The proposed framework offers a new selection process of the binarization methods, which are actually the experts in the ensemble, by introducing three concepts: confidentness, endorsement and schools of experts. The framework, which is highly objective, is built based on two general principles: (i) consolidation of saturated opinions and (ii) identification of schools of experts. After building the endorsement graph of the ensemble for an input document image based on the confidentness of the experts, the saturated opinions are consolidated, and then the schools of experts are identified by thresholding the consolidated endorsement graph. A variation of the framework, in which no selection is made, is also introduced that combines the outputs of all experts using endorsement-dependent weights. The EoE framework is evaluated on the set of participating methods in the H-DIBCO'12 contest and also on an ensemble generated from various instances of grid-based Sauvola method with promising performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes