Moving Beyond the Turing Test with the Allen AI Science Challenge
This addresses the need for better benchmarks in AI evaluation, though it is incremental as it builds on existing competition frameworks.
The paper tackles the problem of assessing AI's proximity to human-level intelligence by introducing the Allen AI Science Challenge, which resulted in a Kaggle competition that provided insights and next steps for evaluation.
Given recent successes in AI (e.g., AlphaGo's victory against Lee Sedol in the game of GO), it's become increasingly important to assess: how close are AI systems to human-level intelligence? This paper describes the Allen AI Science Challenge---an approach towards that goal which led to a unique Kaggle Competition, its results, the lessons learned, and our next steps.