CLJun 11, 2025

Classifying Unreliable Narrators with Large Language Models

arXiv:2506.10231v12 citationsh-index: 3ACL
Originality Incremental advance
AI Analysis

This addresses the need for automated reliability assessment in narratives for applications like content moderation or literary analysis, but it is incremental as it builds on existing LLM methods.

The paper tackled the problem of identifying unreliable narrators in text by creating a dataset and evaluating LLMs on classification tasks, finding the task challenging but showing potential for LLMs to detect unreliability.

Often when we interact with a first-person account of events, we consider whether or not the narrator, the primary speaker of the text, is reliable. In this paper, we propose using computational methods to identify unreliable narrators, i.e. those who unintentionally misrepresent information. Borrowing literary theory from narratology to define different types of unreliable narrators based on a variety of textual phenomena, we present TUNa, a human-annotated dataset of narratives from multiple domains, including blog posts, subreddit posts, hotel reviews, and works of literature. We define classification tasks for intra-narrational, inter-narrational, and inter-textual unreliabilities and analyze the performance of popular open-weight and proprietary LLMs for each. We propose learning from literature to perform unreliable narrator classification on real-world text data. To this end, we experiment with few-shot, fine-tuning, and curriculum learning settings. Our results show that this task is very challenging, and there is potential for using LLMs to identify unreliable narrators. We release our expert-annotated dataset and code and invite future research in this area.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes