LOAIARDec 7, 2025

Formal that "Floats" High: Formal Verification of Floating Point Arithmetic

arXiv:2512.06850v1h-index: 4ICM
Originality Incremental advance
AI Analysis

This addresses verification challenges for hardware designers working with floating-point arithmetic, representing an incremental improvement through hybrid methods.

The paper tackles the challenge of formally verifying floating-point arithmetic by proposing a scalable methodology using direct RTL-to-RTL model checking with a divide-and-conquer strategy and AI-assisted property generation. Results show this approach achieves higher coverage efficiency and requires fewer assertions than standalone verification, particularly when combining AI-generated properties with human refinement.

Formal verification of floating-point arithmetic remains challenging due to non-linear arithmetic behavior and the tight coupling between control and datapath logic. Existing approaches often rely on high-level C models for equivalence checking against Register Transfer Level (RTL) designs, but this introduces abstraction gaps, translation overhead, and limits scalability at the RTL level. To address these challenges, this paper presents a scalable methodology for verifying floating-point arithmetic using direct RTL-to-RTL model checking against a golden reference model. The approach adopts a divide-and conquer strategy that decomposes verification into modular stages, each captured by helper assertions and lemmas that collectively prove a main correctness theorem. Counterexample (CEX)-guided refinement is used to iteratively localize and resolve implementation defects, while targeted fault injection validates the robustness of the verification process against precision-critical datapath errors. To assess scalability and practicality, the methodology is extended with agentic AI-based formal property generation, integrating large language model (LLM)-driven automation with Human-in-the-Loop (HITL) refinement. Coverage analysis evaluates the effectiveness of the approach by comparing handwritten and AI-generated properties in both RTL-to-RTL model checking and standalone RTL verification settings. Results show that direct RTL-to-RTL model checking achieves higher coverage efficiency and requires fewer assertions than standalone verification, especially when combined with AI-generated properties refined through HITL guidance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes