CYApr 3

Million Tutoring Moves (MTM): An Open Multimodal Dataset for the Science of Tutoring

arXiv:2605.0809248.9
AI Analysis

For researchers and developers in AI and education, this provides a large-scale, open dataset to study tutoring processes and build AI systems, though it is an initial release with limited data.

The paper introduces the Million Tutoring Moves (MTM) dataset, an open multimodal resource containing 4,654 math tutoring transcripts from an online platform, aiming to advance research on tutoring interactions and AI educational tools.

We introduce the Million Tutoring Moves (MTM) project, an open dataset initiative aimed at advancing the science of tutoring through large-scale, reusable, and multimodal interaction data. MTM is developed within the National Tutoring Observatory (NTO), a research infrastructure designed to study authentic tutoring interactions and translate them into actionable insights for research, practice, and AI-powered educational technology development. In this paper, we present the vision behind MTM and describe MTM v1, an initial release consisting of 4,654 math tutoring transcripts from a U.S.-based nonprofit online tutoring platform. MTM v1 serves as a first step toward a broader repository that is safe, open, large-scale, broad-coverage, and multimodal. By making tutoring interactions systematically observable and analyzable, MTM aims to support research on instructional processes, improve tutoring practice, and enable the development of AI systems grounded in real educational interactions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes