SE AI LGJul 11, 2024

DeepCodeProbe: Towards Understanding What Models Trained on Code Learn

Vahid Majdinasab, Amin Nikanjam, Foutse Khomh

arXiv:2407.08890v13.32 citationsh-index: 48Has Code

Originality Incremental advance

AI Analysis

This addresses interpretability issues for software maintenance models, particularly in safety-critical applications, but is incremental as it builds on existing probing methods.

The paper tackles the interpretability problem of machine learning models trained on code by introducing DeepCodeProbe, a probing approach that examines syntax and representation learning, finding that increasing model capacity improves syntax learning but with trade-offs like overfitting.

Machine learning models trained on code and related artifacts offer valuable support for software maintenance but suffer from interpretability issues due to their complex internal variables. These concerns are particularly significant in safety-critical applications where the models' decision-making processes must be reliable. The specific features and representations learned by these models remain unclear, adding to the hesitancy in adopting them widely. To address these challenges, we introduce DeepCodeProbe, a probing approach that examines the syntax and representation learning abilities of ML models designed for software maintenance tasks. Our study applies DeepCodeProbe to state-of-the-art models for code clone detection, code summarization, and comment generation. Findings reveal that while small models capture abstract syntactic representations, their ability to fully grasp programming language syntax is limited. Increasing model capacity improves syntax learning but introduces trade-offs such as increased training time and overfitting. DeepCodeProbe also identifies specific code patterns the models learn from their training data. Additionally, we provide best practices for training models on code to enhance performance and interpretability, supported by an open-source replication package for broader application of DeepCodeProbe in interpreting other code-related models.

View on arXiv PDF

Similar