ARAIJul 11, 2024

Natural language is not enough: Benchmarking multi-modal generative AI for Verilog generation

arXiv:2407.08473v117 citationsh-index: 30Has Code
AI Analysis

This addresses the need for more effective hardware design automation, particularly for complex spatial architectures, though it is incremental as it builds on existing generative AI methods.

The paper tackled the problem of generating Verilog code from high-level specifications by showing that multi-modal inputs combining visual and natural language outperform natural-language-only approaches, achieving a significant accuracy improvement.

Natural language interfaces have exhibited considerable potential in the automation of Verilog generation derived from high-level specifications through the utilization of large language models, garnering significant attention. Nevertheless, this paper elucidates that visual representations contribute essential contextual information critical to design intent for hardware architectures possessing spatial complexity, potentially surpassing the efficacy of natural-language-only inputs. Expanding upon this premise, our paper introduces an open-source benchmark for multi-modal generative models tailored for Verilog synthesis from visual-linguistic inputs, addressing both singular and complex modules. Additionally, we introduce an open-source visual and natural language Verilog query language framework to facilitate efficient and user-friendly multi-modal queries. To evaluate the performance of the proposed multi-modal hardware generative AI in Verilog generation tasks, we compare it with a popular method that relies solely on natural language. Our results demonstrate a significant accuracy improvement in the multi-modal generated Verilog compared to queries based solely on natural language. We hope to reveal a new approach to hardware design in the large-hardware-design-model era, thereby fostering a more diversified and productive approach to hardware design.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes