SE CL LGOct 23, 2020

On the Embeddings of Variables in Recurrent Neural Networks for Source Code

arXiv:2010.12693v2727 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of accurately representing variable semantics in programming languages for developers and tools, though it is incremental as it builds on existing NLP methods.

The paper tackled the problem of variable semantics in source code processing by developing dynamic embeddings that adjust based on variable context, resulting in significant performance improvements in code completion and bug fixing tasks.

Source code processing heavily relies on the methods widely used in natural language processing (NLP), but involves specifics that need to be taken into account to achieve higher quality. An example of this specificity is that the semantics of a variable is defined not only by its name but also by the contexts in which the variable occurs. In this work, we develop dynamic embeddings, a recurrent mechanism that adjusts the learned semantics of the variable when it obtains more information about the variable's role in the program. We show that using the proposed dynamic embeddings significantly improves the performance of the recurrent neural network, in code completion and bug fixing tasks.

View on arXiv PDF

Similar