LGNov 7, 2024

The Fibonacci Network: A Simple Alternative for Positional Encoding

arXiv:2411.05052v1h-index: 1
Originality Incremental advance
AI Analysis

This work addresses a specific issue in neural network design for signal reconstruction, offering an incremental improvement by replacing positional encoding with a novel block-based architecture.

The paper tackles the problem of high-frequency reconstruction in coordinate-based MLPs by proposing the Fibonacci Network, a simple architecture that eliminates the need for positional encoding and achieves similar results without high-frequency artifacts or additional hyperparameters.

Coordinate-based Multi-Layer Perceptrons (MLPs) are known to have difficulty reconstructing high frequencies of the training data. A common solution to this problem is Positional Encoding (PE), which has become quite popular. However, PE has drawbacks. It has high-frequency artifacts and adds another hyper-hyperparameter, just like batch normalization and dropout do. We believe that under certain circumstances PE is not necessary, and a smarter construction of the network architecture together with a smart training method is sufficient to achieve similar results. In this paper, we show that very simple MLPs can quite easily output a frequency when given input of the half-frequency and quarter-frequency. Using this, we design a network architecture in blocks, where the input to each block is the output of the two previous blocks along with the original input. We call this a {\it Fibonacci Network}. By training each block on the corresponding frequencies of the signal, we show that Fibonacci Networks can reconstruct arbitrarily high frequencies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes