AIMay 28

NaRA: Noise-Aware LoRA for Parameter-Efficient Fine-Tuning of Diffusion LLMs

Shuaidi Wang, Zhan Zhuang, Ruping Huang, Yu Zhang

arXiv:2605.2971694.3Has Code

Predicted impact top 13% in AI · last 90 daysOriginality Incremental advance

AI Analysis

For practitioners fine-tuning diffusion LLMs, NaRA addresses the overlooked noise-level dynamics, offering a more effective PEFT method.

NaRA introduces noise-aware LoRA for parameter-efficient fine-tuning of diffusion LLMs, achieving consistent improvements over noise-agnostic baselines on commonsense reasoning, mathematical reasoning, and code generation benchmarks.

Diffusion Large Language Models (dLLMs) have emerged as a promising non-autoregressive generative paradigm. Given the prohibitive computational cost of full fine-tuning, Parameter-Efficient Fine-Tuning (PEFT) has become the standard approach. However, existing PEFT methods (e.g., LoRA), originally tailored for autoregressive models, rely on static parameters that are agnostic to the noise level. Consequently, they ignore the intrinsic dynamics of the diffusion process, where input distributions and generation difficulty shift significantly along the denoising trajectory, rendering them suboptimal for dLLMs. To address this, we propose Noise-aware Low-Rank Adaptation (NaRA), which introduces a low-rank core matrix generated by a lightweight, globally shared hypernetwork conditioned on the noise level. This design enables the update matrices to vary continuously along the diffusion process while keeping parameter and latency overhead negligible. We provide a theoretical justification for the proposed NaRA framework and empirically demonstrate consistent improvements over noise-agnostic baselines across commonsense reasoning, mathematical reasoning, and code generation benchmarks. Our code is available at https://github.com/generaldi/NaRA.

View on arXiv PDF Code

Similar