Inference for Change Points in High Dimensional Mean Shift Models
This work addresses the challenge of statistical inference for change points in high-dimensional data, which is incremental as it builds on existing estimation methods to provide rigorous confidence intervals.
The paper tackles the problem of constructing confidence intervals for change point locations in high-dimensional mean shift models, developing a locally refitted least squares estimator that achieves optimal component-wise rates and the sharpest simultaneous rates in the literature, enabling asymptotically valid confidence intervals.
We consider the problem of constructing confidence intervals for the locations of change points in a high-dimensional mean shift model. To that end, we develop a locally refitted least squares estimator and obtain component-wise and simultaneous rates of estimation of the underlying change points. The simultaneous rate is the sharpest available in the literature by at least a factor of $\log p,$ while the component-wise one is optimal. These results enable existence of limiting distributions. Component-wise distributions are characterized under both vanishing and non-vanishing jump size regimes, while joint distributions for any finite subset of change point estimates are characterized under the latter regime, which also yields asymptotic independence of these estimates. The combined results are used to construct asymptotically valid component-wise and simultaneous confidence intervals for the change point parameters. The results are established under a high dimensional scaling, allowing for diminishing jump sizes, in the presence of diverging number of change points and under subexponential errors. They are illustrated on synthetic data and on sensor measurements from smartphones for activity recognition.