Wei Li

h-index21

3papers

38citations

Novelty45%

AI Score47

Ranked #33,337 of 194,257 authors (top 17%)#7,863 in LG (top 20%)

3 Papers

16.1SIMay 21

Fostering cultural change in research through innovative knowledge sharing, evaluation, and community engagement strategies

Junsuk Rho, Jinn-Kong Sheu, Andrew Forbes et al.

Scientific research needs a system that better values rigorous, reusable contributions. Although open knowledge and FAIR (findable, accessible, interoperable, and reusable) principles, along with coalitions and infrastructures, are accelerating reform, evaluation still often defaults to standardized metrics such as the h-index and journal impact factor. This misalignment still incentivizes quantity over quality, undermining integrity and reproducibility, and making it harder for communities to learn from and build on existing work. In this perspective, we bring together a global community of researchers, funding institutions, industrial partners, and publishers from 14 different countries across the 5 continents to advance ongoing debates on open science and research evaluation. Our contribution to the research practice is to offer an integrative conceptual framework, an open knowledge system, that links knowledge production, validation, assessment, and reuse into a single ecosystem view, and to translate into practical recommendations across key stakeholder roles (researchers, institutions/evaluators, funders, and publishers). By shifting attention from papers and bibliometrics toward reusable knowledge contributions and their validation, the framework highlights concrete levers for cultural change (what to share, when/how to validate, how to support reuse, and what to reward) and offers a practical lens that stakeholders can use to diagnose misaligned incentives and to design reforms that make high-quality, cumulative contributions visible and valued.

27.7LGFeb 24, 2025Code

Delta Decompression for MoE-based LLMs Compression

Hao Gu, Wei Li, Lujun Li et al.

Mixture-of-Experts (MoE) architectures in large language models (LLMs) achieve exceptional performance, but face prohibitive storage and memory requirements. To address these challenges, we present $D^2$-MoE, a new delta decompression compressor for reducing the parameters of MoE LLMs. Based on observations of expert diversity, we decompose their weights into a shared base weight and unique delta weights. Specifically, our method first merges each expert's weight into the base weight using the Fisher information matrix to capture shared components. Then, we compress delta weights through Singular Value Decomposition (SVD) by exploiting their low-rank properties. Finally, we introduce a semi-dynamical structured pruning strategy for the base weights, combining static and dynamic redundancy analysis to achieve further parameter reduction while maintaining input adaptivity. In this way, our $D^2$-MoE successfully compact MoE LLMs to high compression ratios without additional training. Extensive experiments highlight the superiority of our approach, with over 13% performance gains than other compressors on Mixtral|Phi-3.5|DeepSeek|Qwen2 MoE LLMs at 40$\sim$60% compression rates. Codes are available in https://github.com/lliai/D2MoE.

2.0LGMay 28, 2023

On the Value of Myopic Behavior in Policy Reuse

Kang Xu, Chenjia Bai, Shuang Qiu et al.

Leveraging learned strategies in unfamiliar scenarios is fundamental to human intelligence. In reinforcement learning, rationally reusing the policies acquired from other tasks or human experts is critical for tackling problems that are difficult to learn from scratch. In this work, we present a framework called Selective Myopic bEhavior Control~(SMEC), which results from the insight that the short-term behaviors of prior policies are sharable across tasks. By evaluating the behaviors of prior policies via a hybrid value function architecture, SMEC adaptively aggregates the sharable short-term behaviors of prior policies and the long-term behaviors of the task policy, leading to coordinated decisions. Empirical results on a collection of manipulation and locomotion tasks demonstrate that SMEC outperforms existing methods, and validate the ability of SMEC to leverage related prior policies.