LGCLIRMLApr 27, 2018

Auto-Detection of Safety Issues in Baby Products

arXiv:1805.09772v2
Originality Synthesis-oriented
AI Analysis

This work addresses consumer safety by enabling earlier recalls to reduce injuries and deaths, though it is incremental as it applies existing methods to a new dataset.

The paper tackled the problem of detecting safety issues in baby products by analyzing online customer reviews and official complaints, achieving a logistic regression model with 66% precision in identifying top safety concerns.

Every year, thousands of people receive consumer product related injuries. Research indicates that online customer reviews can be processed to autonomously identify product safety issues. Early identification of safety issues can lead to earlier recalls, and thus fewer injuries and deaths. A dataset of product reviews from Amazon.com was compiled, along with \emph{SaferProducts.gov} complaints and recall descriptions from the Consumer Product Safety Commission (CPSC) and European Commission Rapid Alert system. A system was built to clean the collected text and to extract relevant features. Dimensionality reduction was performed by computing feature relevance through a Random Forest and discarding features with low information gain. Various classifiers were analyzed, including Logistic Regression, SVMs, Na{ï}ve-Bayes, Random Forests, and an Ensemble classifier. Experimentation with various features and classifier combinations resulted in a logistic regression model with 66\% precision in the top 50 reviews surfaced. This classifier outperforms all benchmarks set by related literature and consumer product safety professionals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes