MLJul 17, 2017

PAC-Bayes and Domain Adaptation

arXiv:1707.05712v326 citations
Originality Incremental advance
AI Analysis

This work addresses domain adaptation for machine learning, offering incremental theoretical improvements and practical algorithms.

The paper tackles domain adaptation by deriving tighter PAC-Bayesian bounds for target risk, improving upon prior work with a novel pseudodistance and a new bound that controls distribution divergence. It infers two learning algorithms from these bounds and evaluates them on real data.

We provide two main contributions in PAC-Bayesian theory for domain adaptation where the objective is to learn, from a source distribution, a well-performing majority vote on a different, but related, target distribution. Firstly, we propose an improvement of the previous approach we proposed in Germain et al. (2013), which relies on a novel distribution pseudodistance based on a disagreement averaging, allowing us to derive a new tighter domain adaptation bound for the target risk. While this bound stands in the spirit of common domain adaptation works, we derive a second bound (introduced in Germain et al., 2016) that brings a new perspective on domain adaptation by deriving an upper bound on the target risk where the distributions' divergence-expressed as a ratio-controls the trade-off between a source error measure and the target voters' disagreement. We discuss and compare both results, from which we obtain PAC-Bayesian generalization bounds. Furthermore, from the PAC-Bayesian specialization to linear classifiers, we infer two learning algorithms, and we evaluate them on real data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes