IRAICRLGJul 19, 2022

Defending Substitution-Based Profile Pollution Attacks on Sequential Recommenders

arXiv:2207.11237v136 citationsh-index: 22
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in sequential recommender systems, which are widely used in e-commerce and content platforms, by introducing both an attack to expose weaknesses and a defense method, though it is incremental as it builds on existing adversarial attack and defense techniques in machine learning.

The paper demonstrates that sequential recommender systems are vulnerable to substitution-based profile pollution attacks, which significantly degrade performance, and proposes a defense method called Dirichlet neighborhood sampling that outperforms baselines across various models and datasets.

While sequential recommender systems achieve significant improvements on capturing user dynamics, we argue that sequential recommenders are vulnerable against substitution-based profile pollution attacks. To demonstrate our hypothesis, we propose a substitution-based adversarial attack algorithm, which modifies the input sequence by selecting certain vulnerable elements and substituting them with adversarial items. In both untargeted and targeted attack scenarios, we observe significant performance deterioration using the proposed profile pollution algorithm. Motivated by such observations, we design an efficient adversarial defense method called Dirichlet neighborhood sampling. Specifically, we sample item embeddings from a convex hull constructed by multi-hop neighbors to replace the original items in input sequences. During sampling, a Dirichlet distribution is used to approximate the probability distribution in the neighborhood such that the recommender learns to combat local perturbations. Additionally, we design an adversarial training method tailored for sequential recommender systems. In particular, we represent selected items with one-hot encodings and perform gradient ascent on the encodings to search for the worst case linear combination of item embeddings in training. As such, the embedding function learns robust item representations and the trained recommender is resistant to test-time adversarial examples. Extensive experiments show the effectiveness of both our attack and defense methods, which consistently outperform baselines by a significant margin across model architectures and datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes