LG AIApr 10

Transformation Categorization Based on Group Decomposition Theory Using Parameter Division

Takayuki Komatsu, Yoshiyuki Ohmura, Yasuo Kuniyoshi

arXiv:2605.0405623.31 citationsh-index: 16

AI Analysis

This work provides a principled algebraic framework for learning disentangled representations of coupled transformations, addressing a known bottleneck in unsupervised representation learning.

The authors propose a method for unsupervised categorization of transformations between image pairs using group decomposition theory, eliminating prior auxiliary assumptions. Their approach achieves correct categorization on rotation, translation, and scale transformations, with ablations confirming that group-decomposition constraints are responsible for the performance.

Representation learning seeks meaningful sensory representations without supervision and can model aspects of human development. Although many neural networks empirically learn useful features, a principled account of what makes a representation "good" remains elusive. We study unsupervised categorization of transformations between pairs of inputs under algebraic constraints. Classical disentanglement favors mutually independent factors and fails when factors are coupled. Our prior Galois-theoretic approach decomposes a group via normal subgroups by learning a product of two transformations with one factor constrained to a normal subgroup, covering both commutative and non-commutative cases. That method, however, relied on auxiliary assumptions (e.g., motion and isometry restrictions) not required by decomposition theory, and ablations did not separate theory-based from auxiliary effects. We propose parameter division for a single transformation: we split its parameter into components, impose homomorphism constraints mapping the full transformation to one component, and identify the normal subgroup as the set of transformations when that component is fixed to the identity. This formulation drops the previous auxiliary assumptions and applies more broadly. We evaluate on image pairs involving rotation, translation, and scale; ablations show that group-decomposition constraints drive appropriate categorization.

View on arXiv PDF

Similar