LGAICECLMar 30

OneComp: One-Line Revolution for Generative AI Model Compression

arXiv:2603.2884574.91 citationsh-index: 5Has Code
AI Analysis

For practitioners deploying large models, OneComp simplifies the complex process of model compression, making it reproducible and hardware-aware.

OneComp is an open-source framework that automates post-training compression of foundation models, converting a fragmented workflow into a reproducible pipeline that plans mixed-precision assignments and executes progressive quantization stages. It bridges the gap between algorithmic innovation and production deployment by ensuring deployable checkpoints at each stage.

Deploying foundation models is increasingly constrained by memory footprint, latency, and hardware costs. Post-training compression can mitigate these bottlenecks by reducing the precision of model parameters without significantly degrading performance; however, its practical implementation remains challenging as practitioners navigate a fragmented landscape of quantization algorithms, precision budgets, data-driven calibration strategies, and hardware-dependent execution regimes. We present OneComp, an open-source compression framework that transforms this expert workflow into a reproducible, resource-adaptive pipeline. Given a model identifier and available hardware, OneComp automatically inspects the model, plans mixed-precision assignments, and executes progressive quantization stages, ranging from layer-wise compression to block-wise refinement and global refinement. A key architectural choice is treating the first quantized checkpoint as a deployable pivot, ensuring that each subsequent stage improves the same model and that quality increases as more compute is invested. By converting state-of-the-art compression research into an extensible, open-source, hardware-aware pipeline, OneComp bridges the gap between algorithmic innovation and production-grade model deployment.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes