CLAIHCMay 14, 2025

An AI-Powered Research Assistant in the Lab: A Practical Guide for Text Analysis Through Iterative Collaboration with LLMs

arXiv:2505.09724v21 citationsh-index: 6
Originality Synthesis-oriented
AI Analysis

This offers a practical guide for researchers in social sciences or similar fields to automate text analysis, though it is incremental as it builds on existing LLM methods.

The paper tackles the problem of time-consuming and biased text analysis by providing a step-by-step tutorial for using LLMs to develop and apply taxonomies through iterative collaboration with researchers, demonstrating high intercoder reliability in categorizing datasets like personal goals.

Analyzing texts such as open-ended responses, headlines, or social media posts is a time- and labor-intensive process highly susceptible to bias. LLMs are promising tools for text analysis, using either a predefined (top-down) or a data-driven (bottom-up) taxonomy, without sacrificing quality. Here we present a step-by-step tutorial to efficiently develop, test, and apply taxonomies for analyzing unstructured data through an iterative and collaborative process between researchers and LLMs. Using personal goals provided by participants as an example, we demonstrate how to write prompts to review datasets and generate a taxonomy of life domains, evaluate and refine the taxonomy through prompt and direct modifications, test the taxonomy and assess intercoder agreements, and apply the taxonomy to categorize an entire dataset with high intercoder reliability. We discuss the possibilities and limitations of using LLMs for text analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes