Mining Duplicate Questions of Stack Overflow
This addresses duplicate question detection for Stack Overflow users, but it is incremental as it builds on existing neural methods with a focus on code modeling.
The paper tackled the problem of duplicate questions on Stack Overflow, which degrade site quality, by proposing two neural network architectures that explicitly model code in questions, achieving results that surpass the state of the art.
There has a been a significant rise in the use of Community Question Answering sites (CQAs) over the last decade owing primarily to their ability to leverage the wisdom of the crowd. Duplicate questions have a crippling effect on the quality of these sites. Tackling duplicate questions is therefore an important step towards improving quality of CQAs. In this regard, we propose two neural network based architectures for duplicate question detection on Stack Overflow. We also propose explicitly modeling the code present in questions to achieve results that surpass the state of the art.