SEAIPLApr 22, 2024

Assessing GPT-4-Vision's Capabilities in UML-Based Code Generation

arXiv:2404.14370v112 citationsh-index: 82024 IEEE/ACM International Workshop on Large Language Models for Code (LLM4Code)
Originality Synthesis-oriented
AI Analysis

This addresses software developers seeking automated code generation tools, though it's an incremental evaluation of an existing model on new data.

This paper evaluated GPT-4-Vision's ability to generate Java code from UML class diagrams, finding it successfully transformed 88% of diagram elements on average, with strong performance on single-class diagrams but weaker results on multi-class ones.

The emergence of advanced neural networks has opened up new ways in automated code generation from conceptual models, promising to enhance software development processes. This paper presents a preliminary evaluation of GPT-4-Vision, a state-of-the-art deep learning model, and its capabilities in transforming Unified Modeling Language (UML) class diagrams into fully operating Java class files. In our study, we used exported images of 18 class diagrams comprising 10 single-class and 8 multi-class diagrams. We used 3 different prompts for each input, and we manually evaluated the results. We created a scoring system in which we scored the occurrence of elements found in the diagram within the source code. On average, the model was able to generate source code for 88% of the elements shown in the diagrams. Our results indicate that GPT-4-Vision exhibits proficiency in handling single-class UML diagrams, successfully transforming them into syntactically correct class files. However, for multi-class UML diagrams, the model's performance is weaker compared to single-class diagrams. In summary, further investigations are necessary to exploit the model's potential completely.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes