TY - JOUR
T1 - Empirically evaluating the application of reinforcement learning to the induction of effective and adaptive pedagogical strategies
AU - Chi, Min
AU - VanLehn, Kurt
AU - Litman, Diane
AU - Jordan, Pamela
N1 - Funding Information:
Acknowledgments NSF (#0325054) supported this work. We also thank the Learning Research and Development Center at the University of Pittsburgh for providing all the facilities used in this work.
PY - 2011/4
Y1 - 2011/4
N2 - For many forms of e-learning environments, the system's behavior can be viewed as a sequential decision process wherein, at each discrete step, the system is responsible for selecting the next action to take. Pedagogical strategies are policies to decide the next system action when there are multiple ones available. In this project we present a Reinforcement Learning (RL) approach for inducing effective pedagogical strategies and empirical evaluations of the induced strategies. This paper addresses the technical challenges in applying RL to Cordillera, a Natural Language Tutoring System teaching students introductory college physics. The algorithm chosen for this project is a model-based RL approach, Policy Iteration, and the training corpus for the RL approach is an exploratory corpus, which was collected by letting the system make random decisions when interacting with real students. Overall, our results show that by using a rather small training corpus, the RL-induced strategies indeed measurably improved the effectiveness of Cordillera in that the RL-induced policies improved students' learning gains significantly.
AB - For many forms of e-learning environments, the system's behavior can be viewed as a sequential decision process wherein, at each discrete step, the system is responsible for selecting the next action to take. Pedagogical strategies are policies to decide the next system action when there are multiple ones available. In this project we present a Reinforcement Learning (RL) approach for inducing effective pedagogical strategies and empirical evaluations of the induced strategies. This paper addresses the technical challenges in applying RL to Cordillera, a Natural Language Tutoring System teaching students introductory college physics. The algorithm chosen for this project is a model-based RL approach, Policy Iteration, and the training corpus for the RL approach is an exploratory corpus, which was collected by letting the system make random decisions when interacting with real students. Overall, our results show that by using a rather small training corpus, the RL-induced strategies indeed measurably improved the effectiveness of Cordillera in that the RL-induced policies improved students' learning gains significantly.
KW - Human learning
KW - Machine learning
KW - Pedagogical strategy
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=79955803320&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79955803320&partnerID=8YFLogxK
U2 - 10.1007/s11257-010-9093-1
DO - 10.1007/s11257-010-9093-1
M3 - Article
AN - SCOPUS:79955803320
SN - 0924-1868
VL - 21
SP - 137
EP - 180
JO - User Modeling and User-Adapted Interaction
JF - User Modeling and User-Adapted Interaction
IS - 1-2
ER -