ARLGDec 10, 2024

MAGE: A Multi-Agent Engine for Automated RTL Code Generation

arXiv:2412.07822v126 citationsh-index: 10Has Code
Originality Highly original
AI Analysis

This addresses the problem of reliable automated RTL code generation for hardware design engineers, representing a strong specific gain rather than an incremental improvement.

The paper tackles the challenge of generating syntactically and functionally correct RTL code from natural language instructions by introducing MAGE, a multi-agent AI system that achieves a 95.7% correctness rate on the VerilogEval-Human 2 benchmark, surpassing the state-of-the-art by 23.3%.

The automatic generation of RTL code (e.g., Verilog) through natural language instructions has emerged as a promising direction with the advancement of large language models (LLMs). However, producing RTL code that is both syntactically and functionally correct remains a significant challenge. Existing single-LLM-agent approaches face substantial limitations because they must navigate between various programming languages and handle intricate generation, verification, and modification tasks. To address these challenges, this paper introduces MAGE, the first open-source multi-agent AI system designed for robust and accurate Verilog RTL code generation. We propose a novel high-temperature RTL candidate sampling and debugging system that effectively explores the space of code candidates and significantly improves the quality of the candidates. Furthermore, we design a novel Verilog-state checkpoint checking mechanism that enables early detection of functional errors and delivers precise feedback for targeted fixes, significantly enhancing the functional correctness of the generated RTL code. MAGE achieves a 95.7% rate of syntactic and functional correctness code generation on VerilogEval-Human 2 benchmark, surpassing the state-of-the-art Claude-3.5-sonnet by 23.3 %, demonstrating a robust and reliable approach for AI-driven RTL design workflows.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes