CRDMSTJun 30, 2021

Protecting Time Series Data with Minimal Forecast Loss

arXiv:2106.16085v2
Originality Incremental advance
AI Analysis

It addresses the challenge of balancing data privacy with forecasting accuracy for data providers and forecasters, though it is incremental as it builds on existing anonymization guidelines.

The paper tackles the problem of protecting time series data under privacy regulations like GDPR and CCPA while minimizing forecast loss, developing k-nTS Swapping and k-mTS Shuffling methods that maintain forecasts and patterns better than standard methods.

Forecasting could be negatively impacted due to anonymization requirements in data protection legislation. To measure the potential severity of this problem, we derive theoretical bounds for the loss to forecasts from additive exponential smoothing models using protected data. Following the guidelines of anonymization from the General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA), we develop the $k$-nearest Time Series ($k$-nTS) Swapping and $k$-means Time Series ($k$-mTS) Shuffling methods to create protected time series data that minimizes the loss to forecasts while preventing a data intruder from detecting privacy issues. For efficient and effective decision making, we formally model an integer programming problem for a perfect matching for simultaneous data swapping in each cluster. We call it a two-party data privacy framework since our optimization model includes the utilities of a data provider and data intruder. We apply our data protection methods to thousands of time series and find that it maintains the forecasts and patterns (level, trend, and seasonality) of time series well compared to standard data protection methods suggested in legislation. Substantively, our paper addresses the challenge of protecting time series data when used for forecasting. Our findings suggest the managerial importance of incorporating the concerns of forecasters into the data protection itself.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes