CLAIMar 15, 2024

Read between the lines -- Functionality Extraction From READMEs

arXiv:2403.10205v1h-index: 11NAACL-HLT
Originality Incremental advance
AI Analysis

This addresses a specific need for developers and researchers working with code-related tasks by providing a new dataset and method for functionality extraction, though it is incremental as it builds on existing text summarization and LLM techniques.

The paper tackles the problem of extracting functionality from Git README files, a novel text-to-text generation task, by introducing a human-annotated dataset called FuncRead and developing models for it. The result shows that fine-tuned small models outperform large language models like ChatGPT and Bard, with a 7 Billion CodeLlama model achieving 70% and 20% gains in F1 score against them.

While text summarization is a well-known NLP task, in this paper, we introduce a novel and useful variant of it called functionality extraction from Git README files. Though this task is a text2text generation at an abstract level, it involves its own peculiarities and challenges making existing text2text generation systems not very useful. The motivation behind this task stems from a recent surge in research and development activities around the use of large language models for code-related tasks, such as code refactoring, code summarization, etc. We also release a human-annotated dataset called FuncRead, and develop a battery of models for the task. Our exhaustive experimentation shows that small size fine-tuned models beat any baseline models that can be designed using popular black-box or white-box large language models (LLMs) such as ChatGPT and Bard. Our best fine-tuned 7 Billion CodeLlama model exhibit 70% and 20% gain on the F1 score against ChatGPT and Bard respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes