AI LGFeb 2

What LLMs Think When You Don't Tell Them What to Think About?

arXiv:2602.01689v14.41 citationsh-index: 3

Originality Incremental advance

AI Analysis

This work addresses the problem of characterizing LLM behavior for AI safety by identifying biases and limitations in near-unconstrained generation, which is incremental but provides new insights into model-specific tendencies.

The study analyzed what large language models generate from minimal, topic-neutral inputs, revealing that each model family exhibits strong and systematic topical preferences, such as GPT-OSS generating programming (27.1%) and mathematical content (24.6%), and differences in content specialization and depth.

Characterizing the behavior of large language models (LLMs) across diverse settings is critical for reliable monitoring and AI safety. However, most existing analyses rely on topic- or task-specific prompts, which can substantially limit what can be observed. In this work, we study what LLMs generate from minimal, topic-neutral inputs and probe their near-unconstrained generative behavior. Despite the absence of explicit topics, model outputs cover a broad semantic space, and surprisingly, each model family exhibits strong and systematic topical preferences. GPT-OSS predominantly generates programming (27.1%) and mathematical content (24.6%), whereas Llama most frequently generates literary content (9.1%). DeepSeek often generates religious content, while Qwen frequently generates multiple-choice questions. Beyond topical preferences, we also observe differences in content specialization and depth: GPT-OSS often generates more technically advanced content (e.g., dynamic programming) compared with other models (e.g., basic Python). Furthermore, we find that the near-unconstrained generation often degenerates into repetitive phrases, revealing interesting behaviors unique to each model family. For instance, degenerate outputs from Llama include multiple URLs pointing to personal Facebook and Instagram accounts. We release the complete dataset of 256,000 samples from 16 LLMs, along with a reproducible codebase.

View on arXiv PDF

Similar