PLMay 9

Quantitative Comparison of Credible Compilation and Verification In Coding Agent Compiler Development

arXiv:2605.089279.9
Predicted impact top 17% in PL · last 90 daysOriginality Incremental advance
AI Analysis

For compiler developers and verification researchers, this provides concrete evidence of the trade-offs between credible compilation and full verification in an agent-assisted setting.

The paper presents the first quantitative comparison of credible compilation and full verification in compiler development, using a coding agent. Results show verification requires ~10x more effort, leads to less efficient algorithms, and certificate checking dominates runtime.

Formal program verification is a longstanding goal in the field. We present the first quantitative comparison of the two primary compiler verification approaches, credible compilation/translation validation and full verification. Working with the first verified compiler developed by a coding agent (operating under human supervision), we present quantitative results from a coding agent implementing several optimizations using these two approaches. The results indicate that 1) verification requires roughly an order of magnitude more development effort than credible compilation, 2) to enhance provability, the coding agent chooses less efficient algorithms and data structures for verified optimizations, and 3) in an attempt to minimize proof effort the coding agent repeatedly implemented optimization scope reductions for verified optimizations, and 4) certificate checking time dominates optimization and certificate generation time for the considered optimizations. Because of the increased proof overhead, verified optimizations required substantially more supervision and coding sessions than credible compilation optimizations. Given the capabilities of a modern coding agent working in this context, implementation efforts for both credible compilation and verified versions remained feasible for the considered optimizations (unreachable code elimination, dead assignment elimination, and constant propagation/folding).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes