ITDSLGNov 9, 2016

A Unified Maximum Likelihood Approach for Optimal Distribution Property Estimation

arXiv:1611.02960v211 citations
Originality Incremental advance
AI Analysis

This provides a unified approach for distribution property estimation, potentially simplifying and optimizing methods in data science, though it appears incremental as it builds on existing PML concepts.

The paper tackles the problem of estimating various symmetric distribution properties, such as support size and entropy, by proving that a single plug-in estimator, profile maximum likelihood (PML), performs as well as the best specialized techniques for each property.

The advent of data science has spurred interest in estimating properties of distributions over large alphabets. Fundamental symmetric properties such as support size, support coverage, entropy, and proximity to uniformity, received most attention, with each property estimated using a different technique and often intricate analysis tools. We prove that for all these properties, a single, simple, plug-in estimator---profile maximum likelihood (PML)---performs as well as the best specialized techniques. This raises the possibility that PML may optimally estimate many other symmetric properties.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes