A topological solution to object segmentation and tracking
This addresses the challenge of object segmentation and tracking for computer vision, offering a non-learning alternative to current methods, though it appears incremental as it builds on existing topological ideas.
The paper tackled the problem of segmenting and tracking objects in video without learning, by using a topological representation derived from light rays, and demonstrated its ability to handle severe appearance changes in cluttered synthetic video.
The world is composed of objects, the ground, and the sky. Visual perception of objects requires solving two fundamental challenges: segmenting visual input into discrete units, and tracking identities of these units despite appearance changes due to object deformation, changing perspective, and dynamic occlusion. Current computer vision approaches to segmentation and tracking that approach human performance all require learning, raising the question: can objects be segmented and tracked without learning? Here, we show that the mathematical structure of light rays reflected from environment surfaces yields a natural representation of persistent surfaces, and this surface representation provides a solution to both the segmentation and tracking problems. We describe how to generate this surface representation from continuous visual input, and demonstrate that our approach can segment and invariantly track objects in cluttered synthetic video despite severe appearance changes, without requiring learning.