CLJan 23, 2021

Reproducibility, Replicability and Beyond: Assessing Production Readiness of Aspect Based Sentiment Analysis in the Wild

arXiv:2101.09449v119 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work addresses reproducibility and deployment challenges for practitioners in e-commerce and sentiment analysis, but it is incremental as it reviews existing models and provides new datasets.

The authors assessed the production readiness of aspect-based sentiment analysis models, finding a 4-5% average drop in test accuracy due to reproducibility issues and a 12-55% drop on challenging data slices, while showing that using 10-25% of domain-specific data with cross-domain datasets can close performance gaps.

With the exponential growth of online marketplaces and user-generated content therein, aspect-based sentiment analysis has become more important than ever. In this work, we critically review a representative sample of the models published during the past six years through the lens of a practitioner, with an eye towards deployment in production. First, our rigorous empirical evaluation reveals poor reproducibility: an average 4-5% drop in test accuracy across the sample. Second, to further bolster our confidence in empirical evaluation, we report experiments on two challenging data slices, and observe a consistent 12-55% drop in accuracy. Third, we study the possibility of transfer across domains and observe that as little as 10-25% of the domain-specific training dataset, when used in conjunction with datasets from other domains within the same locale, largely closes the gap between complete cross-domain and complete in-domain predictive performance. Lastly, we open-source two large-scale annotated review corpora from a large e-commerce portal in India in order to aid the study of replicability and transfer, with the hope that it will fuel further growth of the field.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes