Memory-efficient and fast implementation of local adaptive binarization methods
This enables real-time application of binarization methods on resource-limited devices, representing an incremental improvement in efficiency.
The paper tackles the problem of memory-intensive local adaptive binarization for noisy images by proposing a recursive method that avoids integral images, achieving runtime in Θ(HW) time independent of window size and reducing auxiliary space to around 6 min{H,W} bytes compared to 16HW bytes.
Binarization is widely used as an image preprocessing step to separate object especially text from background before recognition. For noisy images with uneven illumination such as degraded documents, threshold values need to be computed pixel by pixel to obtain a good segmentation. Since local threshold values typically depend on moment-based statistics such as mean and variance of gray levels inside rectangular windows, integral images which are memory consuming are commonly used to accelerate the calculation. Observed that moment-based statistics as well as quantiles in a sliding window can be computed recursively, integral images can be avoided without neglecting speed, more binarization methods can be accelerated too. In particular, given a $H\times W$ input image, Sauvola's method and alike can run in $Θ(HW)$ time independent of window size, while only around $6\min\{H,W\}$ bytes of auxiliary space is needed, which is significantly lower than the $16HW$ bytes occupied by the two integral images. Since the proposed technique enable various well-known local adaptive binarization methods to be applied in real-time use cases on devices with limited resources, it has the potential of wide application.