LGCLSEMLSep 3, 2020

CoNCRA: A Convolutional Neural Network Code Retrieval Approach

arXiv:2009.01959v113 citations
Originality Incremental advance
AI Analysis

This addresses the challenge for software developers who need to find code snippets based on natural language queries, though it appears incremental as it builds on existing methods.

The authors tackled the problem of semantic code search by proposing CoNCRA, a convolutional neural network approach that improved state-of-the-art by 5% on average and retrieved the most relevant code snippets in the top 3 positions nearly 80% of the time.

Software developers routinely search for code using general-purpose search engines. However, these search engines cannot find code semantically unless it has an accompanying description. We propose a technique for semantic code search: A Convolutional Neural Network approach to code retrieval (CoNCRA). Our technique aims to find the code snippet that most closely matches the developer's intent, expressed in natural language. We evaluated our approach's efficacy on a dataset composed of questions and code snippets collected from Stack Overflow. Our preliminary results showed that our technique, which prioritizes local interactions (words nearby), improved the state-of-the-art (SOTA) by 5% on average, retrieving the most relevant code snippets in the top 3 (three) positions by almost 80% of the time. Therefore, our technique is promising and can improve the efficacy of semantic code retrieval.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes