MLITLGDec 3, 2019

Sequential Classification with Empirically Observed Statistics

arXiv:1912.01170v314 citations
Originality Incremental advance
AI Analysis

This addresses a practical problem for real-world machine learning applications where data arrives sequentially and distributions are unknown, offering an incremental improvement over prior work.

The paper tackles sequential classification with unknown distributions using only empirically sampled sequences, proposing a classifier that minimizes the number of test samples needed while analyzing error probabilities. It demonstrates significant advantages over an existing non-sequential method and extends results to multi-class scenarios without requiring a rejection option.

Motivated by real-world machine learning applications, we consider a statistical classification task in a sequential setting where test samples arrive sequentially. In addition, the generating distributions are unknown and only a set of empirically sampled sequences are available to a decision maker. The decision maker is tasked to classify a test sequence which is known to be generated according to either one of the distributions. In particular, for the binary case, the decision maker wishes to perform the classification task with minimum number of the test samples, so, at each step, she declares that either hypothesis 1 is true, hypothesis 2 is true, or she requests for an additional test sample. We propose a classifier and analyze the type-I and type-II error probabilities. We demonstrate the significant advantage of our sequential scheme compared to an existing non-sequential classifier proposed by Gutman. Finally, we extend our setup and results to the multi-class classification scenario and again demonstrate that the variable-length nature of the problem affords significant advantages as one can achieve the same set of exponents as Gutman's fixed-length setting but without having the rejection option.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes