LGCRMLJun 22, 2024

Credit Attribution and Stable Compression

arXiv:2406.15916v23 citations
Originality Incremental advance
AI Analysis

This addresses the issue of properly crediting original creators in generative models and academic research, though it is incremental as it builds on well-studied stability concepts.

The paper tackles the problem of credit attribution in machine learning by proposing new stability definitions that relax differential privacy for a subset of datapoints, allowing them to be used non-stably with permission while ensuring the rest have no significant influence on the output. It characterizes learnability within the PAC framework and extends existing stability notions like differential privacy and stable sample compression.

Credit attribution is crucial across various fields. In academic research, proper citation acknowledges prior work and establishes original contributions. Similarly, in generative models, such as those trained on existing artworks or music, it is important to ensure that any generated content influenced by these works appropriately credits the original creators. We study credit attribution by machine learning algorithms. We propose new definitions--relaxations of Differential Privacy--that weaken the stability guarantees for a designated subset of $k$ datapoints. These $k$ datapoints can be used non-stably with permission from their owners, potentially in exchange for compensation. Meanwhile, the remaining datapoints are guaranteed to have no significant influence on the algorithm's output. Our framework extends well-studied notions of stability, including Differential Privacy ($k = 0$), differentially private learning with public data (where the $k$ public datapoints are fixed in advance), and stable sample compression (where the $k$ datapoints are selected adaptively by the algorithm). We examine the expressive power of these stability notions within the PAC learning framework, provide a comprehensive characterization of learnability for algorithms adhering to these principles, and propose directions and questions for future research.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes