Adversarial Autoencoder and Multi-Armed Bandit for Dynamic Difficulty Adjustment in Immersive Virtual Reality for Rehabilitation: Application to Hand Movement

被引：6

作者：

Kamikokuryo, Kenta ^{[1
]}

Haga, Takumi ^{[1
]}

Venture, Gentiane ^{[2
]}

Hernandez, Vincent ^{[1
,3
]}

机构：

[1] Tokyo Univ Agr & Technol, Dept Mech Syst Engn, Tokyo 1840012, Japan

[2] Univ Tokyo, Dept Mech Engn, Tokyo 1138654, Japan

[3] Surfclean Inc, Sagamihara, Kanagawa 2520131, Japan

来源：

SENSORS | 2022年 / 22卷 / 12期

关键词：

machine learning; reinforcement learning; multi-armed bandit; immersive virtual reality; dynamic difficulty adjustment; end effector;

D O I：

10.3390/s22124499

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Motor rehabilitation is used to improve motor control skills to improve the patient's quality of life. Regular adjustments based on the effect of therapy are necessary, but this can be time-consuming for the clinician. This study proposes to use an efficient tool for high-dimensional data by considering a deep learning approach for dimensionality reduction of hand movement recorded using a wireless remote control embedded with the Oculus Rift S. This latent space is created as a visualization tool also for use in a reinforcement learning (RL) algorithm employed to provide a decision-making framework. The data collected consists of motions drawn with wireless remote control in an immersive VR environment for six different motions called "Cube", "Cylinder", "Heart", "Infinity", "Sphere", and "Triangle". From these collected data, different artificial databases were created to simulate variations of the data. A latent space representation is created using an adversarial autoencoder (AAE), taking into account unsupervised (UAAE) and semi-supervised (SSAAE) training. Then, each test point is represented by a distance metric and used as a reward for two classes of Multi-Armed Bandit (MAB) algorithms, namely Boltzmann and Sibling Kalman filters. The results showed that AAE models can represent high-dimensional data in a two-dimensional latent space and that MAB agents can efficiently and quickly learn the distance evolution in the latent space. The results show that Sibling Kalman filter exploration outperforms Boltzmann exploration with an average cumulative weighted probability error of 7.9 versus 19.9 using the UAAE latent space representation and 8.0 versus 20.0 using SSAAE. In conclusion, this approach provides an effective approach to visualize and track current motor control capabilities regarding a target in order to reflect the patient's abilities in VR games in the context of DDA.

引用

页数：22

共 3 条

[1] Dynamic Difficulty Adjustment in Virtual Reality Applications for Upper Limb Rehabilitation
Valencia, Yessica
Majin, Jhon
Guzman, Diego
Londono, Jeronimo
[J]. 2018 IEEE 2ND COLOMBIAN CONFERENCE ON ROBOTICS AND AUTOMATION (CCRA), 2018,
[2] Behavioral and Psychophysiological Measures of Engagement During Dynamic Difficulty Adjustment in Immersive Virtual Reality
Caldas, Oscar I.
Mauledoux, Mauricio
Aviles, Oscar F.
Rodriguez-Guerrero, Carlos
[J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2023, 29 (01) : 16 - 33
[3] Dynamic Unknown Worker Recruitment for Heterogeneous Contextual Labeling Tasks Using Adversarial Multi-Armed Bandit
Xiao, Wucheng
Xiao, Mingjun
Xu, Yin
[J]. 2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 518 - 525

← 1 →