A Deep Reinforcement Learning Method For Multimodal Data Fusion in Action Recognition

被引：18

作者：

Guo, Jiale ^{[1
]}

Liu, Qiang ^{[1
]}

Chen, Enqing ^{[1
]}

机构：

[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2022年 / 29卷

关键词：

Reinforcement learning; Resource management; Data models; Signal processing algorithms; Task analysis; Neural networks; Decision making; Multimodal action recognition; TD3; fusion weight allocation; deep reinforcement learning;

D O I：

10.1109/LSP.2021.3128379

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

At present, in the research of multimodal human action recognition, the weighted fusion method with fixed weight is widely applied in the decision level fusion of most models. In this way, the weight is usually obtained from the original experience or traversal search, which is inaccurate or has a large amount of calculation, and ignores the different representation ability of various modal data for various classes of action information. With the help of the powerful decision-making ability of deep reinforcement learning, we propose a multimodal decision-making fusion weight allocation network based on deep reinforcement learning. This letter mainly discusses the design of the model, which involves the modeling of reinforcement learning problem in action recognition, the design of neural network and the selection of problem-solving scheme. Experimental results on NTU RGB + D and HMDB51 datasets show the effectiveness of the proposed method.

引用

页码：120 / 124

页数：5

共 50 条

[11] Early, intermediate and late fusion strategies for robust deep learning-based multimodal action recognition
Boulahia, Said Yacine
Amamra, Abdenour
Madi, Mohamed Ridha
Daikh, Said
MACHINE VISION AND APPLICATIONS, 2021, 32 (06)
[12] Sensor Data Acquisition and Multimodal Sensor Fusion for Human Activity Recognition Using Deep Learning
Chung, Seungeun
Lim, Jiyoun
Noh, Kyoung Ju
Kim, Gague
Jeong, Hyuntae
SENSORS, 2019, 19 (07)
[13] Better Deep Visual Attention with Reinforcement Learning in Action Recognition
Wang, Gang
Wang, Wenmin
Wang, Jingzhuo
Bu, Yaohua
2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017,
[14] PREDICTABILITY ANALYZING: DEEP REINFORCEMENT LEARNING FOR EARLY ACTION RECOGNITION
Chen, Xiaokai
Gao, Ke
Caol, Juan
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 958 - 963
[15] Action recognition method of spatio-temporal feature fusion deep learning network
Pei, Xiaomin
Fan, Huijie
Tang, Yandong
Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (02):
[16] Online Learning for Multimodal Data Fusion With Application to Object Recognition
Shahrampour, Shahin
Noshad, Mohammad
Ding, Jie
Tarokh, Vahid
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2018, 65 (09) : 1259 - 1263
[17] DGTRL: Deep graph transfer reinforcement learning method based on fusion of knowledge and data
Chen, Genxin
Qi, Jin
Gao, Yu
Zhu, Xingjian
Dong, Zhenjiang
Sun, Yanfei
INFORMATION SCIENCES, 2024, 658
[18] Deep Multimodal Data Fusion
Zhao, Fei
Zhang, Chengcui
Geng, Baocheng
ACM COMPUTING SURVEYS, 2024, 56 (09)
[19] Multimodal data fusion for cancer biomarker discovery with deep learning
Steyaert, Sandra
Pizurica, Marija
Nagaraj, Divya
Khandelwal, Priya
Hernandez-Boussard, Tina
Gentles, Andrew J.
Gevaert, Olivier
NATURE MACHINE INTELLIGENCE, 2023, 5 (04) : 351 - 362
[20] Multimodal data fusion for cancer biomarker discovery with deep learning
Sandra Steyaert
Marija Pizurica
Divya Nagaraj
Priya Khandelwal
Tina Hernandez-Boussard
Andrew J. Gentles
Olivier Gevaert
Nature Machine Intelligence, 2023, 5 : 351 - 362

← 1 2 3 4 5 →