Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations
This research addresses the challenge of computationally guiding scientific ideation for researchers and innovators, offering a structured approach to discovery, though it appears incremental by building on existing methods like contrastive learning and LLMs.
The paper tackles the problem of modeling and discovering impactful scientific knowledge combinations by proposing a framework that uses contrastive learning and reasoning-guided Monte Carlo search with LLMs to identify disruptive method combinations, showing it can model innovation dynamics and highlight high-potential combinations across multiple domains.
The emergence of large language models offers new possibilities for structured exploration of scientific knowledge. Rather than viewing scientific discovery as isolated ideas or content, we propose a structured approach that emphasizes the role of method combinations in shaping disruptive insights. Specifically, we investigate how knowledge unit--especially those tied to methodological design--can be modeled and recombined to yield research breakthroughs. Our proposed framework addresses two key challenges. First, we introduce a contrastive learning-based mechanism to identify distinguishing features of historically disruptive method combinations within problem-driven contexts. Second, we propose a reasoning-guided Monte Carlo search algorithm that leverages the chain-of-thought capability of LLMs to identify promising knowledge recombinations for new problem statements.Empirical studies across multiple domains show that the framework is capable of modeling the structural dynamics of innovation and successfully highlights combinations with high disruptive potential. This research provides a new path for computationally guided scientific ideation grounded in structured reasoning and historical data modeling.