56.0NIJun 2
Throughput Optimization for Multi-AP IEEE P802.11bq Networks Based on Combinatorial Multi-Armed BanditsAnshan Yuan, Mingqi Han, Xinghua Sun
This paper addresses distributed throughput optimization for dense multi-AP IEEE P802.11bq networks. We develop a packet-level model that jointly captures cross-link carrier-sense multiple access with collision avoidance (CSMA/CA), sub-7GHz RTS/CTS exchange, beam-training overhead, directional mmWave interference, signal-to-interference-plus-noise-ratio (SINR)-based MCS selection, and retransmissions. The resulting configuration problem is formulated as a multi-group combinatorial multi-armed bandit (CMAB), where each AP selects its contention window, clear-channel assessment threshold, beamwidth, and MCS reservation margin from finite candidate sets. Inspired by combinatorial successive accept-reject methods, we propose a group-wise feasible CSAR variant that uses Hadamard-guided feasible exploration to estimate empirical ranking scores and eliminate low-performing candidates within each parameter group. Simulations show that the proposed scheme improves aggregate and per-AP throughput over the considered Thompson-sampling baseline across most AP densities and reduces throughput stabilization time by approximately 49$\%$ under the evaluated settings. The learned configurations reveal that high throughput requires a balance among control-channel aggressiveness, mmWave spatial reuse, beam-training cost, and MCS robustness, rather than simply minimizing collisions or maximizing the PHY rate.
42.6DCApr 23
A Task Decomposition and Planning Framework for Efficient LLM Inference in AI-Enabled WiFi-Offload NetworksMingqi Han, Xinghua Sun
AI WiFi offload is emerging as a promising approach for providing large language model (LLM) services to resource-constrained wireless devices. However, unlike conventional edge computing, LLM inference over WiFi must jointly address heterogeneous model capabilities, wireless contention, uncertain task complexity, and semantic correlation among reasoning tasks. In this paper, we investigate LLM inference offloading in a multi-user multi-edge WiFi network, where each task can be executed locally, directly offloaded to a nearby edge access point (AP), or decomposed into multiple subtasks for collaborative execution across local and edge nodes. To this end, we propose a user-edge collaborative framework with an LLM-based planner that not only performs task decomposition but also infers subtask difficulty and expected output token length, enabling more accurate estimation of execution quality and latency on heterogeneous nodes. Based on these estimates, we further design a decomposition-aware scheduling strategy that jointly optimizes subtask assignment, execution, and aggregation under communication, queuing, and computation constraints. Simulation results show that the proposed framework achieves a better latency-accuracy tradeoff than local-only and nearest-edge baselines, reducing the average latency by $20\%$ and improving the overall reward by $80\%$. Moreover, the distilled lightweight planner approaches the performance of the large teacher model while remaining more suitable for practical edge deployment.