62.5GTMar 21
Mechanism for Collaborative Federated Learning: Pitfalls of Shapley ValuesMeng Qi, Mingxi Zhu
This paper investigates the impact of mechanism design on collaborative learning systems enabled by federated learning (FL). We propose a multi-action collaborative federated learning (MCFL) framework, capturing the interplay between agent strategies, platform mechanisms, and FL algorithms--a "three-body problem" in collaborative learning. This work demonstrates how the convergence rate and computational efficiency of FL are endogenously determined by the agent participation equilibrium that is induced by the mechanism. By doing so, we establish a direct link between incentive design in collaborative learning systems and the performance of the underlying optimization algorithms, a connection that has been largely overlooked in the existing literature. Specifically, we characterize the equilibrium of agent participation under two prominent mechanisms: the Shapley Value (SV) and Marginal Contribution (MC) mechanisms. Although SV is fair in surplus allocation and budget balanced, it has a vital pitfall: agents are incentivized to split their data across newly created fake identities. This is critical especially in the MCFL setting as it leads to slow convergence of FL optimization, which increases the number of required synchronization/communication rounds even when the per-round cost is fixed. In contrast, while MC is not budget-balanced, it is robust to such strategic manipulation and is able to induce an equilibrium that maximizes the MCFL system efficiency. Overall, our study lays a foundation for jointly designing incentives and algorithms in MCFL systems. We provide insights on pitfalls of SV: it induces a system equilibrium that leads to tremendous training cost and slower convergence, ultimately undermining the effectiveness of collaborative learning.
IVMay 28, 2025
Subspecialty-Specific Foundation Model for Intelligent Gastrointestinal PathologyLianghui Zhu, Xitong Ling, Minxi Ouyang et al.
Gastrointestinal (GI) diseases represent a clinically significant burden, necessitating precise diagnostic approaches to optimize patient outcomes. Conventional histopathological diagnosis suffers from limited reproducibility and diagnostic variability. To overcome these limitations, we develop Digepath, a specialized foundation model for GI pathology. Our framework introduces a dual-phase iterative optimization strategy combining pretraining with fine-screening, specifically designed to address the detection of sparsely distributed lesion areas in whole-slide images. Digepath is pretrained on over 353 million multi-scale images from 210,043 H&E-stained slides of GI diseases. It attains state-of-the-art performance on 33 out of 34 tasks related to GI pathology, including pathological diagnosis, protein expression status prediction, gene mutation prediction, and prognosis evaluation. We further translate the intelligent screening module for early GI cancer and achieve near-perfect 99.70% sensitivity across nine independent medical institutions. This work not only advances AI-driven precision pathology for GI diseases but also bridge critical gaps in histopathological practice.
OCJul 3, 2019
On a Randomized Multi-Block ADMM for Solving Selected Machine Learning ProblemsMingxi Zhu, Kresimir Mihic, Yinyu Ye
The Alternating Direction Method of Multipliers (ADMM) has now days gained tremendous attentions for solving large-scale machine learning and signal processing problems due to the relative simplicity. However, the two-block structure of the classical ADMM still limits the size of the real problems being solved. When one forces a more-than-two-block structure by variable-splitting, the convergence speed slows down greatly as observed in practice. Recently, a randomly assembled cyclic multi-block ADMM (RAC-MBADMM) was developed by the authors for solving general convex and nonconvex quadratic optimization problems where the number of blocks can go greater than two so that each sub-problem has a smaller size and can be solved much more efficiently. In this paper, we apply this method to solving few selected machine learning problems related to convex quadratic optimization, such as Linear Regression, LASSO, Elastic-Net, and SVM. We prove that the algorithm would converge in expectation linearly under the standard statistical data assumptions. We use our general-purpose solver to conduct multiple numerical tests, solving both synthetic and large-scale bench-mark problems. Our results show that RAC-MBADMM could significantly outperform, in both solution time and quality, other optimization algorithms/codes for solving these machine learning problems, and match up the performance of the best tailored methods such as Glmnet or LIBSVM. In certain problem regions RAC-MBADMM even achieves a superior performance than that of the tailored methods.