CRCYLGJan 21, 2022

Privacy Policies Across the Ages: Content and Readability of Privacy Policies 1996--2021

arXiv:2201.08739v111 citations
Originality Synthesis-oriented
AI Analysis

This addresses the problem of ineffective privacy policies for users, highlighting regulatory impacts and proposing improvements, though it is incremental in applying existing methods to new longitudinal data.

The study analyzed the 25-year evolution of privacy policies from 1996 to 2021, finding that they have become longer and harder to read, particularly after regulations like GDPR and CCPA, and uncovered concerning data practices.

It is well-known that most users do not read privacy policies, but almost all users tick the box to agree with them. In this paper, we analyze the 25-year history of privacy policies using methods from transparency research, machine learning, and natural language processing. Specifically, we collect a large-scale longitudinal corpus of privacy policies from 1996 to 2021 and analyze the length and readability of privacy policies as well as their content in terms of the data practices they describe, the rights they grant to users, and the rights they reserve for their organizations. We pay particular attention to changes in response to recent privacy regulations such as the GDPR and CCPA. Our results show that policies are getting longer and harder to read, especially after new regulations take effect, and we find a range of concerning data practices. Our results allow us to speculate why privacy policies are rarely read and propose changes that would make privacy policies serve their readers instead of their writers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes