83.3ITApr 15
Scalable Design for RIS-Assisted Multi-User Downlink System Empowered by RSMA under Partial CSIYifan Fang, Bile Peng, Yingyang Chen et al.
In large-scale reconfigurable intelligent surface (RIS) communication systems, the precise acquisition of channel state information (CSI) is challenging. Consider a practical RIS configuration where only a few reflective elements serve as anchors to estimate CSI, which are termed partial CSI. To improve the robustness against partial CSI and the scalability of RIS networks, this paper proposes an unsupervised learning-based rate-splitting multiple access (RSMA) scheme for RIS-assisted multi-user systems. Specifically, RISnet, a neural network architecture designed to infer full CSI from partial observations, is employed and integrated with a low-complexity RSMA precoder. Effective channel features are constituted from partial CSI, and the original elements with channel information contribute to new anchors after expansion in RISnet. Numerical results demonstrate that the proposed scheme approximates the performance with a full CSI of RIS under deterministic raytracing channel conditions. When channel uncertainty increases during training, RSMA has been shown to enhance RISnet robustness, significantly mitigating performance loss.
CVDec 6, 2024
Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language ModelsZehao Wang, Xinpeng Liu, Xiaoqian Wu et al.
Multimodal Large Language Models (MLLMs) have garnered significant attention recently and demonstrate outstanding capabilities in various tasks such as OCR, VQA, captioning, $\textit{etc}$. However, hallucination remains a persistent issue. While numerous methods have been proposed to mitigate hallucinations, achieving notable improvements, these methods primarily focus on mitigating hallucinations about $\textbf{object/noun-related}$ concepts. Verb concepts, crucial for understanding human actions, have been largely overlooked. In this paper, to the best of our knowledge, we are the $\textbf{first}$ to investigate the $\textbf{verb hallucination}$ phenomenon of MLLMs from various perspectives. Our findings reveal that most state-of-the-art MLLMs suffer from severe verb hallucination. To assess the effectiveness of existing mitigation methods for object concept hallucination on verb hallucination, we evaluated these methods and found that they do not effectively address verb hallucination. To address this issue, we propose a novel rich verb knowledge-based tuning method to mitigate verb hallucination. The experiment results demonstrate that our method significantly reduces hallucinations related to verbs.