MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning
This addresses the problem of enhancing AI systems for knowledge and reasoning tasks beyond linguistic processing, representing a novel architectural approach rather than an incremental improvement.
The paper tackles the limitations of large language models by proposing a modular neuro-symbolic architecture called MRKL that integrates neural models with discrete knowledge and reasoning modules, resulting in a flexible system design as demonstrated by the Jurassic-X implementation.
Huge language models (LMs) have ushered in a new era for AI, serving as a gateway to natural-language-based knowledge tasks. Although an essential element of modern AI, LMs are also inherently limited in a number of ways. We discuss these limitations and how they can be avoided by adopting a systems approach. Conceptualizing the challenge as one that involves knowledge and reasoning in addition to linguistic processing, we define a flexible architecture with multiple neural models, complemented by discrete knowledge and reasoning modules. We describe this neuro-symbolic architecture, dubbed the Modular Reasoning, Knowledge and Language (MRKL, pronounced "miracle") system, some of the technical challenges in implementing it, and Jurassic-X, AI21 Labs' MRKL system implementation.