SENov 18, 2017

Automatic link extraction: The good, the bad and the ugly in software ecosystem mining

arXiv:1711.06908v1
Originality Synthesis-oriented
AI Analysis

This work addresses data quality issues for researchers studying multi-platform software evolution, but it is incremental as it builds on existing mining approaches.

The paper tackled the problem of automatic link extraction in software ecosystem mining by identifying pitfalls through manual investigation of RubyGems metadata, with the result being a framework to automate extraction and produce more complete datasets for researchers.

This abstract presents the automatic link extraction pitfalls based on our experience on manually investigating links in the RubyGems package manager metadata. This work can lead in automating the link extraction approach so as to avoid these pitfalls and produce more complete datasets to be used by researchers when they investigate the multi-platform evolution of software ecosystems.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes