IR AI CR LGJul 19, 2022

Defending Substitution-Based Profile Pollution Attacks on Sequential Recommenders

Zhenrui Yue, Huimin Zeng, Ziyi Kou, Lanyu Shang, Dong Wang

arXiv:2207.11237v113.636 citationsh-index: 22Has Code

Originality Incremental advance

AI Analysis

This addresses security vulnerabilities in sequential recommender systems, which are widely used in e-commerce and content platforms, by introducing both an attack to expose weaknesses and a defense method, though it is incremental as it builds on existing adversarial attack and defense techniques in machine learning.

The paper demonstrates that sequential recommender systems are vulnerable to substitution-based profile pollution attacks, which significantly degrade performance, and proposes a defense method called Dirichlet neighborhood sampling that outperforms baselines across various models and datasets.

While sequential recommender systems achieve significant improvements on capturing user dynamics, we argue that sequential recommenders are vulnerable against substitution-based profile pollution attacks. To demonstrate our hypothesis, we propose a substitution-based adversarial attack algorithm, which modifies the input sequence by selecting certain vulnerable elements and substituting them with adversarial items. In both untargeted and targeted attack scenarios, we observe significant performance deterioration using the proposed profile pollution algorithm. Motivated by such observations, we design an efficient adversarial defense method called Dirichlet neighborhood sampling. Specifically, we sample item embeddings from a convex hull constructed by multi-hop neighbors to replace the original items in input sequences. During sampling, a Dirichlet distribution is used to approximate the probability distribution in the neighborhood such that the recommender learns to combat local perturbations. Additionally, we design an adversarial training method tailored for sequential recommender systems. In particular, we represent selected items with one-hot encodings and perform gradient ascent on the encodings to search for the worst case linear combination of item embeddings in training. As such, the embedding function learns robust item representations and the trained recommender is resistant to test-time adversarial examples. Extensive experiments show the effectiveness of both our attack and defense methods, which consistently outperform baselines by a significant margin across model architectures and datasets.

View on arXiv PDF Code

Similar