SECLIRDec 17, 2024

Selective Shot Learning for Code Explanation

arXiv:2412.12852v13 citationsh-index: 13Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for better code explanation tools for software developers, but it is incremental as it builds on existing selective shot learning approaches.

The paper tackles the problem of improving code explanation by LLMs through selective few-shot example selection, proposing a novel method (SSL_ner) that uses entity information and shows effectiveness over state-of-the-art methods across two datasets.

Code explanation plays a crucial role in the software engineering domain, aiding developers in grasping code functionality efficiently. Recent work shows that the performance of LLMs for code explanation improves in a few-shot setting, especially when the few-shot examples are selected intelligently. State-of-the-art approaches for such Selective Shot Learning (SSL) include token-based and embedding-based methods. However, these SSL approaches have been evaluated on proprietary LLMs, without much exploration on open-source Code-LLMs. Additionally, these methods lack consideration for programming language syntax. To bridge these gaps, we present a comparative study and propose a novel SSL method (SSL_ner) that utilizes entity information for few-shot example selection. We present several insights and show the effectiveness of SSL_ner approach over state-of-the-art methods across two datasets. To the best of our knowledge, this is the first systematic benchmarking of open-source Code-LLMs while assessing the performances of the various few-shot examples selection approaches for the code explanation task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes