POEM: A Personalized Online Education Scheme Based on Reinforcement Learning

被引：2

作者：

Wang, Yufeng ^{[1
]}

Cai, Wenjie ^{[1
]}

Chen, Meijuan ^{[1
]}

Shen, Jianhua ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecomm, Coll Telecommun & Informat Engn, Nanjing, Peoples R China

来源：

PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (IEEE TALE 2020) | 2020年

关键词：

personalized education; reinforcement learning; Gaussian process; multi-armed bandit; zone of proximal development (ZPD);

D O I：

10.1109/TALE48869.2020.9368369

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

As online e-learning systems become more prevalent, there is a growing need for them to accommodate individual differences among students. According to the concept of zone of proximal development (ZPD), it is imperative to provide online students with educational contents that are neither too easy nor too difficult, but are slightly beyond their current abilities. However, following ZPD rule is challenging in online e-learning system, due to the following reasons: the system does not know a priori the ability of the online students, especially for the newly arrived student; the exact relationship between student feedback on teaching and their abilities (i.e., reward/gain function) is extremely complicated, and even unknown to each student. Aiming at solving the issue above, this paper proposes a personalized educational scheme to students, POEM, in order to maximize their accumulative learning gains over multiple rounds. Specifically, instead of assuming any specific formal reward function, we first estimate any unknown reward function from noisy samples using Gaussian process (GP) model. Then, the multi-arm bandit based algorithm is used to select the teaching content with the adaptive difficulty level to balance the effect of exploration and exploitation. The simulation results demonstrate the effectiveness of our proposed method.

引用

页码：474 / 481

页数：8

共 50 条

[1] Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems
Wacharawan Intayoad
Chayapol Kamyod
Punnarumol Temdee
[J]. Wireless Personal Communications, 2020, 115 : 2917 - 2932
[2] Reinforcement Learning Based on Contextual Bandits for Personalized Online Learning Recommendation Systems
Intayoad, Wacharawan
Kamyod, Chayapol
Temdee, Punnarumol
[J]. WIRELESS PERSONAL COMMUNICATIONS, 2020, 115 (04) : 2917 - 2932
[3] eTUTOR: ONLINE LEARNING FOR PERSONALIZED EDUCATION
Tekin, Cem
Braun, Jonas
van der Schaar, Mihaela
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5545 - 5549
[4] Online language education recommendation based on personalized learning and edge computing
Wang, Ziling
[J]. INTERNET TECHNOLOGY LETTERS, 2023,
[5] Student Behavior Simulation in English Online Education Based on Reinforcement Learning
Wang, Wenjing
[J]. International Journal of Interactive Mobile Technologies, 2023, 17 (22) : 136 - 151
[6] Personalized education planner (PIP): An online education ontology construction for personalized learning object annotation
Fak, Apple W. P.
Ip, Horace H. S.
[J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON WEB-BASED EDUCATION, 2007, : 80 - +
[7] AN EVALUATION OF PERSONALIZED LEARNING BY ONLINE INFORMAL EDUCATION IN CASE OF DESIGN EDUCATION
Guzel, Zehra Tugba
[J]. TURKISH ONLINE JOURNAL OF DISTANCE EDUCATION, 2024, 25 (02): : 248 - 262
[8] An Online Education Course Recommendation Method Based on Knowledge Graphs and Reinforcement Learning
Guan, Honglei
[J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (06)
[9] Quality Evaluation of Online Mental Health Education Based on Reinforcement Learning in the Pandemic
Zhang, Weifeng
[J]. DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2021, 2021
[10] Multi-Modal LA in Personalized Education Using Deep Reinforcement Learning Based Approach
Sharif, Muddsair
Uckelmann, Dieter
[J]. IEEE ACCESS, 2024, 12 : 54049 - 54065

← 1 2 3 4 5 →