DCSEMar 13

LLM-HPC++: Evaluating LLM-Generated Modern C++ and MPI+OpenMP Codes for Scalable Mandelbrot Set Computation

arXiv:2512.1702360.33 citations
Predicted impact top 21% in DC · last 90 daysOriginality Synthesis-oriented
AI Analysis

This addresses the challenge of automating parallel programming for HPC developers, though it is incremental as it evaluates existing LLMs on a specific task.

The paper tackled the problem of evaluating LLMs for generating correct and efficient HPC code, specifically for Mandelbrot set computation using modern C++ and MPI+OpenMP, finding that ChatGPT-4 and ChatGPT-5 achieved strong syntactic precision and scalable performance.

Parallel programming remains one of the most challenging aspects of High-Performance Computing (HPC), requiring deep knowledge of synchronization, communication, and memory models. While modern C++ standards and frameworks like OpenMP and MPI have simplified parallelism, mastering these paradigms is still complex. Recently, Large Language Models (LLMs) have shown promise in automating code generation, but their effectiveness in producing correct and efficient HPC code is not well understood. In this work, we systematically evaluate leading LLMs including ChatGPT 4 and 5, Claude, and LLaMA on the task of generating C++ implementations of the Mandelbrot set using shared-memory, directive-based, and distributed-memory paradigms. Each generated program is compiled and executed with GCC 11.5.0 to assess its correctness, robustness, and scalability. Results show that ChatGPT-4 and ChatGPT-5 achieve strong syntactic precision and scalable performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes