CVAISEApr 9, 2024

WebCode2M: A Real-World Dataset for Code Generation from Webpage Designs

arXiv:2404.06369v241 citationsh-index: 14WWW
Originality Synthesis-oriented
AI Analysis

This addresses the workload reduction for front-end developers by providing a large-scale, real-world dataset, though it is incremental as it builds on existing MLLM approaches.

The paper tackles the problem of inadequate performance in automated webpage code generation from designs due to a lack of high-quality datasets by introducing WebCode2M, a dataset of 2.56 million real-world instances, which significantly improves MLLMs' code generation abilities as validated by benchmarking results.

Automatically generating webpage code from webpage designs can significantly reduce the workload of front-end developers, and recent Multimodal Large Language Models (MLLMs) have shown promising potential in this area. However, our investigation reveals that most existing MLLMs are constrained by the absence of high-quality, large-scale, real-world datasets, resulting in inadequate performance in automated webpage code generation. To fill this gap, this paper introduces WebCode2M, a new dataset comprising 2.56 million instances, each containing a design image along with the corresponding webpage code and layout details. Sourced from real-world web resources, WebCode2M offers a rich and valuable dataset for webpage code generation across a variety of applications. The dataset quality is ensured by a scoring model that filters out instances with aesthetic deficiencies or other incomplete elements. To validate the effectiveness of WebCode2M, we introduce a baseline model based on the Vision Transformer (ViT), named WebCoder, and establish a benchmark for fair comparison. Additionally, we introduce a new metric, TreeBLEU, to measure the structural hierarchy recall. The benchmarking results demonstrate that our dataset significantly improves the ability of MLLMs to generate code from webpage designs, confirming its effectiveness and usability for future applications in front-end design tools. Finally, we highlight several practical challenges introduced by our dataset, calling for further research. The code and dataset are publicly available at our project homepage: https://webcode2m.github.io.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes