CLED-PHAug 27, 2025

Scalable and consistent few-shot classification of survey responses using text embeddings

arXiv:2508.19836v11 citationsh-index: 16
Originality Incremental advance
AI Analysis

This addresses the challenge for social science researchers by enabling scalable and consistent few-shot classification, though it is incremental as it builds on existing text embedding methods.

The paper tackles the problem of time-consuming and inconsistent qualitative analysis of open-ended survey responses by introducing a text embedding-based classification framework that requires only a few examples per category, achieving a Cohen's Kappa of 0.74 to 0.83 compared to expert human coders on a dataset of 2899 responses.

Qualitative analysis of open-ended survey responses is a commonly-used research method in the social sciences, but traditional coding approaches are often time-consuming and prone to inconsistency. Existing solutions from Natural Language Processing such as supervised classifiers, topic modeling techniques, and generative large language models have limited applicability in qualitative analysis, since they demand extensive labeled data, disrupt established qualitative workflows, and/or yield variable results. In this paper, we introduce a text embedding-based classification framework that requires only a handful of examples per category and fits well with standard qualitative workflows. When benchmarked against human analysis of a conceptual physics survey consisting of 2899 open-ended responses, our framework achieves a Cohen's Kappa ranging from 0.74 to 0.83 as compared to expert human coders in an exhaustive coding scheme. We further show how performance of this framework improves with fine-tuning of the text embedding model, and how the method can be used to audit previously-analyzed datasets. These findings demonstrate that text embedding-assisted coding can flexibly scale to thousands of responses without sacrificing interpretability, opening avenues for deductive qualitative analysis at scale.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes