AMAT: Medial Axis Transform for Natural Images
This work addresses shape decomposition and reconstruction in computer vision, offering a novel method for medial axis analysis in natural images, but it is incremental as it builds on existing medial transform concepts.
The authors tackled the problem of medial axis transform for natural images by introducing AMAT, which extends medial point detection to color images with local scale and invertibility, achieving state-of-the-art performance on the BMAX500 dataset and significantly better image reconstruction quality using only 10% of pixels.
We introduce Appearance-MAT (AMAT), a generalization of the medial axis transform for natural images, that is framed as a weighted geometric set cover problem. We make the following contributions: i) we extend previous medial point detection methods for color images, by associating each medial point with a local scale; ii) inspired by the invertibility property of the binary MAT, we also associate each medial point with a local encoding that allows us to invert the AMAT, reconstructing the input image; iii) we describe a clustering scheme that takes advantage of the additional scale and appearance information to group individual points into medial branches, providing a shape decomposition of the underlying image regions. In our experiments, we show state-of-the-art performance in medial point detection on Berkeley Medial AXes (BMAX500), a new dataset of medial axes based on the BSDS500 database, and good generalization on the SK506 and WH-SYMMAX datasets. We also measure the quality of reconstructed images from BMAX500, obtained by inverting their computed AMAT. Our approach delivers significantly better reconstruction quality with respect to three baselines, using just 10% of the image pixels. Our code and annotations are available at https://github.com/tsogkas/amat .