Classroom AI: Large Language Models as Grade-Specific Teachers
This addresses teacher shortages by enabling AI-assisted learning tailored to different educational levels, though it is an incremental improvement using existing methods on new data.
The paper tackles the problem of large language models failing to provide grade-appropriate educational responses by introducing a framework for finetuning them to generate age-appropriate content across six grade levels, achieving a 35.64 percentage point improvement in grade-level alignment compared to prompt-based methods.
Large Language Models (LLMs) offer a promising solution to complement traditional teaching and address global teacher shortages that affect hundreds of millions of children, but they fail to provide grade-appropriate responses for students at different educational levels. We introduce a framework for finetuning LLMs to generate age-appropriate educational content across six grade levels, from lower elementary to adult education. Our framework successfully adapts explanations to match students' comprehension capacities without sacrificing factual correctness. This approach integrates seven established readability metrics through a clustering method and builds a comprehensive dataset for grade-specific content generation. Evaluations across multiple datasets with 208 human participants demonstrate substantial improvements in grade-level alignment, achieving a 35.64 percentage point increase compared to prompt-based methods while maintaining response accuracy. AI-assisted learning tailored to different grade levels has the potential to advance educational engagement and equity.