LG AI QUANT-PHJun 10, 2025

MetaTT: A Global Tensor-Train Adapter for Parameter-Efficient Fine-Tuning

Javier Lopez-Piqueres, Pranav Deshpande, Archan Ray, Mattia J. Villani, Marco Pistoia, Niraj Kumar

arXiv:2506.09105v213.06 citationsh-index: 6

Originality Incremental advance

AI Analysis

This work addresses the need for more compact and flexible fine-tuning methods in machine learning, though it appears incremental as it builds on existing tensor decomposition techniques.

The authors tackled the problem of parameter-efficient fine-tuning of pre-trained transformers by introducing MetaTT, a Tensor Train adapter framework that factorizes transformer sub-modules into a single shared tensor, achieving competitive parameter efficiency and accuracy tradeoffs on single-task and multi-task benchmarks.

We present MetaTT, a Tensor Train (TT) adapter framework for fine-tuning of pre-trained transformers. MetaTT enables flexible and parameter-efficient model adaptation by using a single shared TT to factorize transformer sub-modules. This factorization indexes key structural dimensions, including layer and matrix type, and can optionally incorporate heads and tasks. This design allows MetaTT's parameter count to scale with the sum, rather than the product, of the modes, resulting in a substantially more compact adapter. Our benchmarks compare MetaTT with LoRA along with recent state-of-the-art matrix and tensor decomposition based fine-tuning methods. We observe that when tested on single-task standard language modeling benchmarks, MetaTT achieves competitive parameter efficiency to accuracy tradeoff. We further demonstrate that MetaTT performs competitively when compared to state-of-the-art methods on multi-task learning. Finally, we leverage the TT-ansatz to design a rank adaptive optimizer inspired by the DMRG method from many-body physics. Our results demonstrate that integrating this approach with AdamW enhances optimization performance for a specified target rank.

View on arXiv PDF

Similar