CYMar 21

A Methodological Guide on Using Large Language Models for Text Annotation in the Social Sciences and Humanities with Python and R

Qixiang Fang, Javier Garcia Bernardo, Erik-Jan van Kesteren

arXiv:2604.0963876.5h-index: 8

AI Analysis

It offers practical guidance for SSH researchers to incorporate LLM-based annotation into their workflows, though it is incremental as it focuses on application rather than new technical breakthroughs.

This paper provides a comprehensive methodological guide for social science and humanities researchers to use large language models (LLMs) for automating text annotation, addressing challenges like accessibility and annotation errors that can bias downstream statistical analyses.

Large language models (LLMs) have become an essential tool for social science and humanities (SSH) researchers who work with text. One particularly valuable application is automating text annotation, a traditionally time-consuming step in preparing data for empirical analysis. Yet many SSH researchers face two challenges: getting started with LLMs and understanding how to address their limitations. Practically, the rapid pace of model development can make LLMs seem inaccessible or intimidating, while even experienced users may overlook how annotation errors can bias downstream statistical analyses (e.g., regression estimates and $p$-values), even when annotation accuracy appears high. This paper provides a comprehensive, step-by-step methodological guide for using LLMs for text annotation in SSH research, with clear Python and R code snippets. We cover (1) how LLMs work and what they can and cannot do; (2) how to identify an LLM-suitable research project and establish minimum data and computational requirements; (3) how to design prompts and run annotation tasks; (4) how to evaluate annotation quality and iteratively refine prompts without overfitting; (5) how to integrate LLM annotations into downstream statistical analyses while accounting for annotation error; and (6) how to manage cost, efficiency, and reproducibility when scaling up annotation. Throughout, we provide intuitive methodological reasoning, concrete examples, code snippets, and best-practice guidance to help researchers confidently and transparently incorporate LLM-based annotation into their scientific workflows.

View on arXiv PDF

Similar