CLMar 30, 2023

Federated Learning Based Multilingual Emoji Prediction In Clean and Attack Scenarios

arXiv:2304.01005v38 citationsh-index: 6
AI Analysis

This addresses privacy-preserving sentiment analysis for social media users, but it is incremental as it applies existing federated learning methods to a specific task.

The paper tackles multilingual emoji prediction using federated learning in clean and attack scenarios, showing that it performs similarly to centralized training across languages and data distributions, with transformers outperforming other techniques on the SemEval dataset.

Federated learning is a growing field in the machine learning community due to its decentralized and private design. Model training in federated learning is distributed over multiple clients giving access to lots of client data while maintaining privacy. Then, a server aggregates the training done on these multiple clients without access to their data, which could be emojis widely used in any social media service and instant messaging platforms to express users' sentiments. This paper proposes federated learning-based multilingual emoji prediction in both clean and attack scenarios. Emoji prediction data have been crawled from both Twitter and SemEval emoji datasets. This data is used to train and evaluate different transformer model sizes including a sparsely activated transformer with either the assumption of clean data in all clients or poisoned data via label flipping attack in some clients. Experimental results on these models show that federated learning in either clean or attacked scenarios performs similarly to centralized training in multilingual emoji prediction on seen and unseen languages under different data sources and distributions. Our trained transformers perform better than other techniques on the SemEval emoji dataset in addition to the privacy as well as distributed benefits of federated learning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes