Back to Explore
cs.DCComputer Science

Distributed Computing

Distributed systems, parallel computing, cloud

99.2LGMay 29
PithTrain: A Compact and Agent-Native MoE Training System

Ruihang Lai, Hao Kang, Haozhan Tang et al.

This work addresses the high cost and complexity for AI coding agents to understand and extend existing MoE training frameworks, offering a more efficient development path for framework engineers and researchers.

98.6DCMay 24Code
Efficient Distributed MLLM Training with Cornstarch

Insu Jang, Runyu Lu, Nikhil Bansal et al.

For researchers and engineers training large multimodal models, Cornstarch provides a more efficient distributed training approach tailored to the heterogeneity of MLLMs.