LGCVMLAug 25, 2020

Transductive Information Maximization For Few-Shot Learning

arXiv:2008.11297v389 citations
AI Analysis

This provides a modular and efficient solution for few-shot learning, improving accuracy in tasks with domain shifts and larger class numbers, though it is incremental as it builds on existing transductive methods.

The paper tackles few-shot learning by introducing Transductive Information Maximization (TIM), which maximizes mutual information between query features and label predictions, and demonstrates that it significantly outperforms state-of-the-art methods with 2% to 5% accuracy improvements across various datasets and challenging scenarios.

We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. Furthermore, we propose a new alternating-direction solver for our mutual-information loss, which substantially speeds up transductive-inference convergence over gradient-based optimization, while yielding similar accuracy. TIM inference is modular: it can be used on top of any base-training feature extractor. Following standard transductive few-shot settings, our comprehensive experiments demonstrate that TIM outperforms state-of-the-art methods significantly across various datasets and networks, while used on top of a fixed feature extractor trained with simple cross-entropy on the base classes, without resorting to complex meta-learning schemes. It consistently brings between 2% and 5% improvement in accuracy over the best performing method, not only on all the well-established few-shot benchmarks but also on more challenging scenarios,with domain shifts and larger numbers of classes.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes