LG AI DC NE PL SESep 5, 2025

Discovering Software Parallelization Points Using Deep Neural Networks

Izavan dos S. Correia, Henrique C. T. Santos, Tiago A. E. Ferreira

arXiv:2509.16215v27.11 citationsh-index: 1

Originality Incremental advance

AI Analysis

This addresses software optimization for developers by automating parallelization point discovery, but it is incremental as it builds on existing deep learning methods for code analysis.

The study tackled the problem of identifying parallelizable loops in programming code by developing deep learning models, achieving a slightly higher mean performance with a CNN compared to a DNN based on 30 independent runs.

This study proposes a deep learning-based approach for discovering loops in programming code according to their potential for parallelization. Two genetic algorithm-based code generators were developed to produce two distinct types of code: (i) independent loops, which are parallelizable, and (ii) ambiguous loops, whose dependencies are unclear, making them impossible to define if the loop is parallelizable or not. The generated code snippets were tokenized and preprocessed to ensure a robust dataset. Two deep learning models - a Deep Neural Network (DNN) and a Convolutional Neural Network (CNN) - were implemented to perform the classification. Based on 30 independent runs, a robust statistical analysis was employed to verify the expected performance of both models, DNN and CNN. The CNN showed a slightly higher mean performance, but the two models had a similar variability. Experiments with varying dataset sizes highlighted the importance of data diversity for model performance. These results demonstrate the feasibility of using deep learning to automate the identification of parallelizable structures in code, offering a promising tool for software optimization and performance improvement.

View on arXiv PDF

Similar