MA AI CYAug 29, 2025

Synthetic Founders: AI-Generated Social Simulations for Startup Validation Research in Computational Social Science

arXiv:2509.02605v1

Originality Synthesis-oriented

AI Analysis

This work provides a comparative framework for using AI-generated personas as complementary tools in computational social science, particularly for startup validation research, though it is incremental in nature.

The study compared human startup founders' interview data with AI-generated synthetic personas to assess fidelity and gaps in AI-enabled social simulations, finding both convergent themes like efficiency gains and divergent ones such as human-only relational value and synthetic-only amplified false positives.

We present a comparative docking experiment that aligns human-subject interview data with large language model (LLM)-driven synthetic personas to evaluate fidelity, divergence, and blind spots in AI-enabled simulation. Fifteen early-stage startup founders were interviewed about their hopes and concerns regarding AI-powered validation, and the same protocol was replicated with AI-generated founder and investor personas. A structured thematic synthesis revealed four categories of outcomes: (1) Convergent themes - commitment-based demand signals, black-box trust barriers, and efficiency gains were consistently emphasized across both datasets; (2) Partial overlaps - founders worried about outliers being averaged away and the stress of real customer validation, while synthetic personas highlighted irrational blind spots and framed AI as a psychological buffer; (3) Human-only themes - relational and advocacy value from early customer engagement and skepticism toward moonshot markets; and (4) Synthetic-only themes - amplified false positives and trauma blind spots, where AI may overstate adoption potential by missing negative historical experiences. We interpret this comparative framework as evidence that LLM-driven personas constitute a form of hybrid social simulation: more linguistically expressive and adaptable than traditional rule-based agents, yet bounded by the absence of lived history and relational consequence. Rather than replacing empirical studies, we argue they function as a complementary simulation category - capable of extending hypothesis space, accelerating exploratory validation, and clarifying the boundaries of cognitive realism in computational social science.

View on arXiv PDF

Similar