LGOct 1, 2025

The Transformer Cookbook

AI2ETH Zurich
arXiv:2510.00368v15 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of accessibility and systematization for both newcomers and experts in transformer algorithm encoding, but it is incremental as it synthesizes existing techniques rather than introducing new methods.

The authors tackled the problem of the steep learning curve and fragmented literature in encoding algorithms into transformer parameters by synthesizing disparate findings into a curated set of recipes, resulting in a unified presentation that provides a foundation for future work in computational complexity and architecture design.

We present the transformer cookbook: a collection of techniques for directly encoding algorithms into a transformer's parameters. This work addresses the steep learning curve of such endeavors, a problem exacerbated by a fragmented literature where key results are scattered across numerous papers. In particular, we synthesize this disparate body of findings into a curated set of recipes that demonstrate how to implement everything from basic arithmetic in feed-forward layers to complex data routing via self-attention. Our mise en place of formulations is for both newcomers seeking an accessible entry point and experts in need of a systematic reference. This unified presentation of transformer constructions provides a foundation for future work spanning theoretical research in computational complexity to empirical investigations in architecture design and interpretability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes