CLIRAug 24, 2025

Routing Distilled Knowledge via Mixture of LoRA Experts for Large Language Model based Bundle Generation

arXiv:2508.17250v12 citationsh-index: 9
Originality Incremental advance
AI Analysis

This addresses computational cost and performance issues in bundle generation for LLM applications, representing an incremental improvement over existing methods.

The paper tackles the problem of knowledge conflict when integrating diverse distilled knowledge from teacher LLMs into student LLMs for bundle generation, proposing RouteDK, which achieves accuracy comparable to or better than the teacher LLM with strong computational efficiency on three datasets.

Large Language Models (LLMs) have shown potential in automatic bundle generation but suffer from prohibitive computational costs. Although knowledge distillation offers a pathway to more efficient student models, our preliminary study reveals that naively integrating diverse types of distilled knowledge from teacher LLMs into student LLMs leads to knowledge conflict, negatively impacting the performance of bundle generation. To address this, we propose RouteDK, a framework for routing distilled knowledge through a mixture of LoRA expert architecture. Specifically, we first distill knowledge from the teacher LLM for bundle generation in two complementary types: high-level knowledge (generalizable rules) and fine-grained knowledge (session-specific reasoning). We then train knowledge-specific LoRA experts for each type of knowledge together with a base LoRA expert. For effective integration, we propose a dynamic fusion module, featuring an input-aware router, where the router balances expert contributions by dynamically determining optimal weights based on input, thereby effectively mitigating knowledge conflicts. To further improve inference reliability, we design an inference-time enhancement module to reduce variance and mitigate suboptimal reasoning. Experiments on three public datasets show that our RouteDK achieves accuracy comparable to or even better than the teacher LLM, while maintaining strong computational efficiency. In addition, it outperforms state-of-the-art approaches for bundle generation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes