CLMar 15

Parameter-Efficient Quality Estimation via Frozen Recursive Models

Umar Abubacar, Roman Bauer, Diptesh Kanojia

arXiv:2603.1459376.72 citationsh-index: 18Has Code

AI Analysis

This work addresses parameter efficiency in Quality Estimation for low-resource languages, though it is incremental as it builds on existing recursive and embedding methods.

The paper tackled the problem of transferring recursive mechanisms from Tiny Recursive Models to Quality Estimation for low-resource languages, finding that these mechanisms do not transfer effectively, but using frozen pretrained embeddings matches fine-tuned performance while reducing trainable parameters by 37 times, achieving a Spearman's correlation of 0.370.

Tiny Recursive Models (TRM) achieve strong results on reasoning tasks through iterative refinement of a shared network. We investigate whether these recursive mechanisms transfer to Quality Estimation (QE) for low-resource languages using a three-phase methodology. Experiments on $8$ language pairs on a low-resource QE dataset reveal three findings. First, TRM's recursive mechanisms do not transfer to QE. External iteration hurts performance, and internal recursion offers only narrow benefits. Next, representation quality dominates architectural choices, and lastly, frozen pretrained embeddings match fine-tuned performance while reducing trainable parameters by 37$\times$ (7M vs 262M). TRM-QE with frozen XLM-R embeddings achieves a Spearman's correlation of 0.370, matching fine-tuned variants (0.369) and outperforming an equivalent-depth standard transformer (0.336). On Hindi and Tamil, frozen TRM-QE outperforms MonoTransQuest (560M parameters) with 80$\times$ fewer trainable parameters, suggesting that weight sharing combined with frozen embeddings enables parameter efficiency for QE. We release the code publicly for further research. Code is available at https://github.com/surrey-nlp/TRMQE.

View on arXiv PDF Code

Similar