CLCYApr 21, 2025

Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection

arXiv:2504.15392v111 citationsh-index: 4NAACL
Originality Incremental advance
AI Analysis

This work addresses the challenge of defining sexism for automated detection, offering a hybrid approach that could enhance AI fairness tools, though it is incremental in combining expert knowledge with LLMs.

The paper tackled the problem of zero-shot sexism detection by investigating collaboration between sexism researchers and LLMs to co-create definitions, finding that LLM-generated definitions generally outperformed expert-written ones, with some experts improving performance through co-created definitions.

This paper investigates hybrid intelligence and collaboration between researchers of sexism and Large Language Models (LLMs), with a four-component pipeline. First, nine sexism researchers answer questions about their knowledge of sexism and of LLMs. They then participate in two interactive experiments involving an LLM (GPT3.5). The first experiment has experts assessing the model's knowledge about sexism and suitability for use in research. The second experiment tasks them with creating three different definitions of sexism: an expert-written definition, an LLM-written one, and a co-created definition. Lastly, zero-shot classification experiments use the three definitions from each expert in a prompt template for sexism detection, evaluating GPT4o on 2.500 texts sampled from five sexism benchmarks. We then analyze the resulting 67.500 classification decisions. The LLM interactions lead to longer and more complex definitions of sexism. Expert-written definitions on average perform poorly compared to LLM-generated definitions. However, some experts do improve classification performance with their co-created definitions of sexism, also experts who are inexperienced in using LLMs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes