CL CY LGJul 12, 2021

Lumen: A Machine Learning Framework to Expose Influence Cues in Text

Hanyu Shi, Mirela Silva, Daniel Capecci, Luiz Giovanini, Lauren Czech, Juliana Fernandes, Daniela Oliveira

arXiv:2107.10655v10.72 citations

Originality Incremental advance

AI Analysis

This work addresses the need for automated tools to improve human detection of deceptive online content, though it is incremental as it builds on existing learning models with a new dataset.

The paper tackles the problem of detecting influence cues in text, such as persuasion and emotion, to combat phishing and disinformation, by introducing Lumen, a machine learning framework that achieved competitive F1-micro scores with better interpretability compared to other models like LSTM.

Phishing and disinformation are popular social engineering attacks with attackers invariably applying influence cues in texts to make them more appealing to users. We introduce Lumen, a learning-based framework that exposes influence cues in text: (i) persuasion, (ii) framing, (iii) emotion, (iv) objectivity/subjectivity, (v) guilt/blame, and (vi) use of emphasis. Lumen was trained with a newly developed dataset of 3K texts comprised of disinformation, phishing, hyperpartisan news, and mainstream news. Evaluation of Lumen in comparison to other learning models showed that Lumen and LSTM presented the best F1-micro score, but Lumen yielded better interpretability. Our results highlight the promise of ML to expose influence cues in text, towards the goal of application in automatic labeling tools to improve the accuracy of human-based detection and reduce the likelihood of users falling for deceptive online content.

View on arXiv PDF

Similar