LGMLApr 17, 2020

Deep Reinforcement Learning for Adaptive Learning Systems

arXiv:2004.08410v153 citations
Originality Incremental advance
AI Analysis

This addresses the problem of personalized education for learners by providing an incremental improvement in adaptive learning systems through reinforcement learning techniques.

The paper tackles the adaptive learning problem by formulating it as a Markov decision process and applying deep Q-learning to find optimal individualized learning policies, with numerical simulations showing efficiency, especially when using a transition model estimator to achieve optimal policies with a small number of learners.

In this paper, we formulate the adaptive learning problem---the problem of how to find an individualized learning plan (called policy) that chooses the most appropriate learning materials based on learner's latent traits---faced in adaptive learning systems as a Markov decision process (MDP). We assume latent traits to be continuous with an unknown transition model. We apply a model-free deep reinforcement learning algorithm---the deep Q-learning algorithm---that can effectively find the optimal learning policy from data on learners' learning process without knowing the actual transition model of the learners' continuous latent traits. To efficiently utilize available data, we also develop a transition model estimator that emulates the learner's learning process using neural networks. The transition model estimator can be used in the deep Q-learning algorithm so that it can more efficiently discover the optimal learning policy for a learner. Numerical simulation studies verify that the proposed algorithm is very efficient in finding a good learning policy, especially with the aid of a transition model estimator, it can find the optimal learning policy after training using a small number of learners.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes