LGAIMay 31, 2023

Towards Fair Disentangled Online Learning for Changing Environments

arXiv:2306.01007v221 citations
Originality Incremental advance
AI Analysis

This work addresses the incremental challenge of integrating fairness into online learning for dynamic environments, which is important for applications like automated decision systems that must adapt to shifting data distributions while avoiding bias.

The paper tackles the problem of online learning in changing environments by proposing a novel algorithm that disentangles environment-invariant and environment-specific factors, incorporating fairness constraints to ensure statistical parity across sub-populations. The result shows empirical improvements in model accuracy and fairness on real-world datasets compared to baseline methods.

In the problem of online learning for changing environments, data are sequentially received one after another over time, and their distribution assumptions may vary frequently. Although existing methods demonstrate the effectiveness of their learning algorithms by providing a tight bound on either dynamic regret or adaptive regret, most of them completely ignore learning with model fairness, defined as the statistical parity across different sub-population (e.g., race and gender). Another drawback is that when adapting to a new environment, an online learner needs to update model parameters with a global change, which is costly and inefficient. Inspired by the sparse mechanism shift hypothesis, we claim that changing environments in online learning can be attributed to partial changes in learned parameters that are specific to environments and the rest remain invariant to changing environments. To this end, in this paper, we propose a novel algorithm under the assumption that data collected at each time can be disentangled with two representations, an environment-invariant semantic factor and an environment-specific variation factor. The semantic factor is further used for fair prediction under a group fairness constraint. To evaluate the sequence of model parameters generated by the learner, a novel regret is proposed in which it takes a mixed form of dynamic and static regret metrics followed by a fairness-aware long-term constraint. The detailed analysis provides theoretical guarantees for loss regret and violation of cumulative fairness constraints. Empirical evaluations on real-world datasets demonstrate our proposed method sequentially outperforms baseline methods in model accuracy and fairness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes