LGNov 11, 2023

From Charts to Atlas: Merging Latent Spaces into One

Donato Crisostomi, Irene Cannistraci, Luca Moschella, Pietro Barbiero, Marco Ciccone, Pietro Liò, Emanuele Rodolà

ETH Zurich

arXiv:2311.06547v18.85 citationsh-index: 19Has Code

Originality Incremental advance

AI Analysis

This addresses the challenge of integrating multiple learned representations for researchers in machine learning, but it is incremental as it builds on existing latent space methods.

The paper tackled the problem of merging latent spaces from models trained on related datasets and tasks, introducing Relative Latent Space Aggregation to create a unified space; the result showed that the aggregated space is similar to an end-to-end model's space and better for classification, with diminished benefits in scenarios without shared regions.

Models trained on semantically related datasets and tasks exhibit comparable inter-sample relations within their latent spaces. We investigate in this study the aggregation of such latent spaces to create a unified space encompassing the combined information. To this end, we introduce Relative Latent Space Aggregation, a two-step approach that first renders the spaces comparable using relative representations, and then aggregates them via a simple mean. We carefully divide a classification problem into a series of learning tasks under three different settings: sharing samples, classes, or neither. We then train a model on each task and aggregate the resulting latent spaces. We compare the aggregated space with that derived from an end-to-end model trained over all tasks and show that the two spaces are similar. We then observe that the aggregated space is better suited for classification, and empirically demonstrate that it is due to the unique imprints left by task-specific embedders within the representations. We finally test our framework in scenarios where no shared region exists and show that it can still be used to merge the spaces, albeit with diminished benefits over naive merging.

View on arXiv PDF Code

Similar