LGCVDec 21, 2018

Sparse One-Time Grab Sampling of Inliers

arXiv:1901.02338v1
Originality Incremental advance
AI Analysis

This addresses a fundamental bottleneck in computer vision and data mining for handling big data, though it appears incremental as it builds on prior sampling methods.

The paper tackles the problem of efficiently sampling from large datasets with multiple structures and outliers to ensure sufficient inliers for model instantiation, proposing a 'one-time-grab' algorithm that minimizes the number of samples needed to guarantee with probability P that at least ε points are selected per structure, where ε is the degrees of freedom.

Estimating structures in "big data" and clustering them are among the most fundamental problems in computer vision, pattern recognition, data mining, and many other other research fields. Over the past few decades, many studies have been conducted focusing on different aspects of these problems. One of the main approaches that is explored in the literature to tackle the problems of size and dimensionality is sampling subsets of the data in order to estimate the characteristics of the whole population, e.g. estimating the underlying clusters or structures in the data. In this paper, we propose a `one-time-grab' sampling algorithm\cite{jaberi2015swift,jaberi2018sparse}. This method can be used as the front end to any supervised or unsupervised clustering method. Rather than focusing on the strategy of maximizing the probability of sampling inliers, our goal is to minimize the number of samples needed to instantiate all underlying model instances. More specifically, our goal is to answer the following question: {\em `Given a very large population of points with $C$ embedded structures and gross outliers, what is the minimum number of points $r$ to be selected randomly in one grab in order to make sure with probability $P$ that at least $\varepsilon$ points are selected on each structure, where $\varepsilon$ is the number of degrees of freedom of each structure.'}

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes