IRAIJan 1

The Discovery Gap: How Product Hunt Startups Vanish in LLM Organic Discovery Queries

arXiv:2601.00912v11 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses the problem for startup founders and marketers seeking visibility in AI-driven search, revealing that Generative Engine Optimization is ineffective and traditional SEO is more impactful, though the findings are incremental as they build on existing SEO and LLM research.

The study investigated how startups from Product Hunt appear in LLM responses to discovery queries, finding that while name-based queries had high recognition rates (99.4% for ChatGPT, 94.3% for Perplexity), discovery-style queries had much lower success (3.32% for ChatGPT, 8.29% for Perplexity), with a 30-to-1 gap for ChatGPT.

When someone asks ChatGPT to recommend a project management tool, which products show up in the response? And more importantly for startup founders: will their newly launched product ever appear? This research set out to answer these questions. I randomly selected 112 startups from the top 500 products featured on the 2025 Product Hunt leaderboard and tested each one across 2,240 queries to two different large language models: ChatGPT (gpt-4o-mini) and Perplexity (sonar with web search). The results were striking. When users asked about products by name, both LLMs recognized them almost perfectly: 99.4% for ChatGPT and 94.3% for Perplexity. But when users asked discovery-style questions like "What are the best AI tools launched this year?" the success rates collapsed to 3.32% and 8.29% respectively. That's a gap of 30-to-1 for ChatGPT. Perhaps the most surprising finding was that Generative Engine Optimization (GEO), the practice of optimizing website content for AI visibility, showed no correlation with actual discovery rates. Products with high GEO scores were no more likely to appear in organic queries than products with low scores. What did matter? For Perplexity, traditional SEO signals like referring domains (r = +0.319, p < 0.001) and Product Hunt ranking (r = -0.286, p = 0.002) predicted visibility. After cleaning the Reddit data for false positives, community presence also emerged as significant (r = +0.395, p = 0.002). The practical takeaway is counterintuitive: don't optimize for AI discovery directly. Instead, build the SEO foundation first and LLM visibility will follow.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes