SEDSAug 22, 2019

Interactive Duplicate Search in Software Documentation

arXiv:1908.08266v1
Originality Incremental advance
AI Analysis

This addresses documentation maintenance issues for software developers, but appears incremental as it builds on existing duplicate detection concepts with user interaction.

The paper tackles the problem of duplicate detection in software documentation, which hinders maintenance, by presenting an interactive process that involves users for meaningful search, and demonstrates results from experiments on industrial project documents.

Various software features such as classes, methods, requirements, and tests often have similar functionality. This can lead to emergence of duplicates in their descriptive documentation. Uncontrolled duplicates created via copy/paste hinder the process of documentation maintenance. Therefore, the task of duplicate detection in software documentation is of importance. Solving it makes planned reuse possible, as well as creating and using templates for unification and automatic generation of documentation. In this paper, we present an interactive process for duplicate detection that involves the user in order to conduct meaningful search. It includes a new formal definition of a near duplicate, a pattern-based, and the proof of its completeness. Moreover, we demonstrate the results of experimenting on a collection of documents of several industrial projects.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes