MECVLGEMMLAug 6, 2019

Semiparametric Wavelet-based JPEG IV Estimator for endogenously truncated data

arXiv:1908.02166v1
AI Analysis

This addresses parameter estimation in machine learning and statistics for cases with unobservable data distributions, though it appears incremental as an enriched JPEG algorithm.

The paper tackles the problem of estimating parameters from endogenously truncated data where the original data distribution is unobservable, such as when covariate shift assumptions are violated in machine learning. Their proposed JPEG IV estimator corrects for biases from endogenous truncation and covariates, achieving high accuracy across 2,000,000 different distribution functions.

A new and an enriched JPEG algorithm is provided for identifying redundancies in a sequence of irregular noisy data points which also accommodates a reference-free criterion function. Our main contribution is by formulating analytically (instead of approximating) the inverse of the transpose of JPEGwavelet transform without involving matrices which are computationally cumbersome. The algorithm is suitable for the widely-spread situations where the original data distribution is unobservable such as in cases where there is deficient representation of the entire population in the training data (in machine learning) and thus the covariate shift assumption is violated. The proposed estimator corrects for both biases, the one generated by endogenous truncation and the one generated by endogenous covariates. Results from utilizing 2,000,000 different distribution functions verify the applicability and high accuracy of our procedure to cases in which the disturbances are neither jointly nor marginally normally distributed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes