SEAIJun 1, 2025

Legal Compliance Evaluation of Smart Contracts Generated By Large Language Models

arXiv:2506.00943v12 citationsh-index: 6ICBC
Originality Incremental advance
AI Analysis

This addresses the challenge of legal compliance in smart contract development for legal and software professionals, though it is incremental as it builds on existing LLM capabilities.

The paper tackled the problem of generating legally compliant smart contracts from natural language legal contracts using Large Language Models (LLMs), finding that while all LLMs produced syntactically correct code, larger models generally achieved higher compliance levels with significant variance.

Smart contracts can implement and automate parts of legal contracts, but ensuring their legal compliance remains challenging. Existing approaches such as formal specification, verification, and model-based development require expertise in both legal and software development domains, as well as extensive manual effort. Given the recent advances of Large Language Models (LLMs) in code generation, we investigate their ability to generate legally compliant smart contracts directly from natural language legal contracts, addressing these challenges. We propose a novel suite of metrics to quantify legal compliance based on modeling both legal and smart contracts as processes and comparing their behaviors. We select four LLMs, generate 20 smart contracts based on five legal contracts, and analyze their legal compliance. We find that while all LLMs generate syntactically correct code, there is significant variance in their legal compliance with larger models generally showing higher levels of compliance. We also evaluate the proposed metrics against properties of software metrics, showing they provide fine-grained distinctions, enable nuanced comparisons, and are applicable across domains for code from any source, LLM or developer. Our results suggest that LLMs can assist in generating starter code for legally compliant smart contracts with strict reviews, and the proposed metrics provide a foundation for automated and self-improving development workflows.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes