LLM-Driven Large-Scale Spectrum Access
For wireless network operators, this provides a scalable approach to spectrum access in ultra-dense environments, though the method is domain-specific and incremental over existing LLM-based optimization.
This work tackles the computational intractability of spectrum management in massive-scale wireless networks by proposing an LLM-driven framework (LSA) with a hierarchical state serialization mechanism. The framework achieves superior scaling laws and outperforms partitioned classical solvers in ultra-dense regimes under matched compute budgets.
Efficient spectrum management in massive-scale wireless networks is increasingly challenged by explosive action spaces and the computational intractability of traditional optimization. This study proposes a Large-Scale LLM-Driven Spectrum Access (LSA) framework rooted in Group Relative Policy Optimization (GRPO). To overcome the computational collapse caused by ultra-long prompts in large-scale scenarios, we develop a hierarchical state serialization mechanism that synthesizes global environment statistics with localized critical constraints, enabling the LLM to perform high-dimensional reasoning within a bounded context window. Simulation results under strictly time-bounded inference protocols reveal that the code-driven paradigm eliminates the SFT cold-start bottleneck and leverages direct execution feedback to achieve superior scaling laws. The framework maintains robust spectral utility and generalization across varying network scales, yielding consistent and empirically superior performance over non-deterministic heuristics, and surpassing partitioned classical solvers in ultra-dense regimes under matched compute budgets.