LG AI CV GN MLSep 23, 2020

Fuzzy Simplicial Networks: A Topology-Inspired Model to Improve Task Generalization in Few-shot Learning

Henry Kvinge, Zachary New, Nico Courts, Jung H. Lee, Lauren A. Phillips, Courtney D. Corley, Aaron Tuor, Andrew Avila, Nathan O. Hodas

arXiv:2009.11253v11.2

Originality Incremental advance

AI Analysis

This addresses the challenge of task generalization in few-shot learning for computer vision, though it appears incremental as it builds on existing metric-based models.

The paper tackles the problem of few-shot learning models failing to generalize to fundamentally different tasks within a fixed dataset, such as moving from category membership to detecting object orientation or quantity. It introduces Fuzzy Simplicial Networks (FSN), which leverages topology to flexibly represent classes from limited data, and shows that FSN outperforms state-of-the-art models on new challenging tasks while remaining competitive on standard benchmarks.

Deep learning has shown great success in settings with massive amounts of data but has struggled when data is limited. Few-shot learning algorithms, which seek to address this limitation, are designed to generalize well to new tasks with limited data. Typically, models are evaluated on unseen classes and datasets that are defined by the same fundamental task as they are trained for (e.g. category membership). One can also ask how well a model can generalize to fundamentally different tasks within a fixed dataset (for example: moving from category membership to tasks that involve detecting object orientation or quantity). To formalize this kind of shift we define a notion of "independence of tasks" and identify three new sets of labels for established computer vision datasets that test a model's ability to generalize to tasks which draw on orthogonal attributes in the data. We use these datasets to investigate the failure modes of metric-based few-shot models. Based on our findings, we introduce a new few-shot model called Fuzzy Simplicial Networks (FSN) which leverages a construction from topology to more flexibly represent each class from limited data. In particular, FSN models can not only form multiple representations for a given class but can also begin to capture the low-dimensional structure which characterizes class manifolds in the encoded space of deep networks. We show that FSN outperforms state-of-the-art models on the challenging tasks we introduce in this paper while remaining competitive on standard few-shot benchmarks.

View on arXiv PDF

Similar