CVLGJul 31, 2022

COCOA: Cross Modality Contrastive Learning for Sensor Data

arXiv:2208.00467v258 citationsh-index: 34
Originality Incremental advance
AI Analysis

This addresses the need for effective self-supervised learning in pervasive computing applications that use multiple sensor modalities, representing an incremental improvement over existing contrastive learning methods.

The paper tackles the problem of learning representations from multisensor data by proposing COCOA, a self-supervised model that uses a novel objective function to compute cross-correlation between modalities and minimize similarity between irrelevant instances, achieving superior classification performance and label-efficiency with only one-tenth of labeled data compared to supervised baselines.

Self-Supervised Learning (SSL) is a new paradigm for learning discriminative representations without labelled data and has reached comparable or even state-of-the-art results in comparison to supervised counterparts. Contrastive Learning (CL) is one of the most well-known approaches in SSL that attempts to learn general, informative representations of data. CL methods have been mostly developed for applications in computer vision and natural language processing where only a single sensor modality is used. A majority of pervasive computing applications, however, exploit data from a range of different sensor modalities. While existing CL methods are limited to learning from one or two data sources, we propose COCOA (Cross mOdality COntrastive leArning), a self-supervised model that employs a novel objective function to learn quality representations from multisensor data by computing the cross-correlation between different data modalities and minimizing the similarity between irrelevant instances. We evaluate the effectiveness of COCOA against eight recently introduced state-of-the-art self-supervised models, and two supervised baselines across five public datasets. We show that COCOA achieves superior classification performance to all other approaches. Also, COCOA is far more label-efficient than the other baselines including the fully supervised model using only one-tenth of available labelled data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes