AIOct 24, 2025

String Seed of Thought: Prompting LLMs for Distribution-Faithful and Diverse Generation

arXiv:2510.21150v23 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses a critical issue for applications requiring non-deterministic behaviors, such as human-behavior simulation and content diversification, though it is an incremental improvement in prompting techniques.

The paper tackles the problem of LLMs failing at Probabilistic Instruction Following (PIF), where they struggle to generate answers that align with target distributions, by introducing String Seed of Thought (SSoT), a prompting method that improves PIF performance to approach that of a pseudo-random number generator and enhances response diversity in open-ended tasks.

We introduce String Seed of Thought (SSoT), a novel prompting method for LLMs that improves Probabilistic Instruction Following (PIF). We define PIF as a task requiring an LLM to select its answer from a predefined set of options, each associated with a specific probability, such that the empirical distribution of the generated answers aligns with the target distribution when prompted multiple times. While LLMs excel at tasks with single, deterministic answers, they often fail at PIF, exhibiting biases problematic for applications requiring non-deterministic behaviors, such as human-behavior simulation, content diversification, and multiplayer games. It also harms the diversity of generated responses, a crucial factor in test-time scaling, by causing the outputs to collapse into a limited set of answers. To address this, we propose SSoT, a simple prompting method that instructs an LLM to first output a random string to generate sufficient entropy. SSoT also instructs the LLM to extract randomness by manipulating this string to derive a final answer, thereby preserving diversity while adhering to specific constraints. We demonstrate that SSoT significantly improves the PIF performance of LLMs, approaching the ideal performance of a pseudo-random number generator. Furthermore, our experiments on NoveltyBench show SSoT's benefits extend beyond closed-set tasks to open-ended tasks by enhancing response diversity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes