CVAILGNov 2, 2022

Attention-based Neural Cellular Automata

arXiv:2211.01233v131 citationsh-index: 54
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving model capabilities in image processing tasks for researchers and practitioners, representing an incremental advancement by integrating attention mechanisms into NCAs.

The paper tackles the problem of enhancing Neural Cellular Automata (NCA) by introducing an attention-based approach, specifically Vision Transformer Cellular Automata (ViTCA), which achieves superior performance in denoising autoencoding across six benchmark datasets compared to baselines like U-Net and Vision Transformer when configured with similar parameter complexity.

Recent extensions of Cellular Automata (CA) have incorporated key ideas from modern deep learning, dramatically extending their capabilities and catalyzing a new family of Neural Cellular Automata (NCA) techniques. Inspired by Transformer-based architectures, our work presents a new class of $\textit{attention-based}$ NCAs formed using a spatially localized$\unicode{x2014}$yet globally organized$\unicode{x2014}$self-attention scheme. We introduce an instance of this class named $\textit{Vision Transformer Cellular Automata}$ (ViTCA). We present quantitative and qualitative results on denoising autoencoding across six benchmark datasets, comparing ViTCA to a U-Net, a U-Net-based CA baseline (UNetCA), and a Vision Transformer (ViT). When comparing across architectures configured to similar parameter complexity, ViTCA architectures yield superior performance across all benchmarks and for nearly every evaluation metric. We present an ablation study on various architectural configurations of ViTCA, an analysis of its effect on cell states, and an investigation on its inductive biases. Finally, we examine its learned representations via linear probes on its converged cell state hidden representations, yielding, on average, superior results when compared to our U-Net, ViT, and UNetCA baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes