SEApr 28, 2020

Human-Like Summaries from Heterogeneous and Time-Windowed Software Development Artefacts

arXiv:2004.14151v12 citations
AI Analysis

This addresses the challenge of summarizing multi-document software artifacts for developers, but it appears incremental as it builds on existing summarization techniques.

The authors tackled the problem of summarizing heterogeneous software development artifacts within a time frame, presenting the first framework for this task and showing that users find summaries most useful when generated using word similarity and based on the eight most relevant artifacts.

Automatic text summarisation has drawn considerable interest in the area of software engineering. It is challenging to summarise the activities related to a software project, (1) because of the volume and heterogeneity of involved software artefacts, and (2) because it is unclear what information a developer seeks in such a multi-document summary. We present the first framework for summarising multi-document software artefacts containing heterogeneous data within a given time frame. To produce human-like summaries, we employ a range of iterative heuristics to minimise the cosine-similarity between texts and high-dimensional feature vectors. A first study shows that users find the automatically generated summaries the most useful when they are generated using word similarity and based on the eight most relevant software artefacts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes