A Deep Reinforcement Learning Method For Multimodal Data Fusion in Action Recognition

被引:18
|
作者
Guo, Jiale [1 ]
Liu, Qiang [1 ]
Chen, Enqing [1 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Peoples R China
关键词
Reinforcement learning; Resource management; Data models; Signal processing algorithms; Task analysis; Neural networks; Decision making; Multimodal action recognition; TD3; fusion weight allocation; deep reinforcement learning;
D O I
10.1109/LSP.2021.3128379
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
At present, in the research of multimodal human action recognition, the weighted fusion method with fixed weight is widely applied in the decision level fusion of most models. In this way, the weight is usually obtained from the original experience or traversal search, which is inaccurate or has a large amount of calculation, and ignores the different representation ability of various modal data for various classes of action information. With the help of the powerful decision-making ability of deep reinforcement learning, we propose a multimodal decision-making fusion weight allocation network based on deep reinforcement learning. This letter mainly discusses the design of the model, which involves the modeling of reinforcement learning problem in action recognition, the design of neural network and the selection of problem-solving scheme. Experimental results on NTU RGB + D and HMDB51 datasets show the effectiveness of the proposed method.
引用
收藏
页码:120 / 124
页数:5
相关论文
共 50 条
  • [11] Early, intermediate and late fusion strategies for robust deep learning-based multimodal action recognition
    Boulahia, Said Yacine
    Amamra, Abdenour
    Madi, Mohamed Ridha
    Daikh, Said
    MACHINE VISION AND APPLICATIONS, 2021, 32 (06)
  • [12] Sensor Data Acquisition and Multimodal Sensor Fusion for Human Activity Recognition Using Deep Learning
    Chung, Seungeun
    Lim, Jiyoun
    Noh, Kyoung Ju
    Kim, Gague
    Jeong, Hyuntae
    SENSORS, 2019, 19 (07)
  • [13] Better Deep Visual Attention with Reinforcement Learning in Action Recognition
    Wang, Gang
    Wang, Wenmin
    Wang, Jingzhuo
    Bu, Yaohua
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017,
  • [14] PREDICTABILITY ANALYZING: DEEP REINFORCEMENT LEARNING FOR EARLY ACTION RECOGNITION
    Chen, Xiaokai
    Gao, Ke
    Caol, Juan
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 958 - 963
  • [15] Action recognition method of spatio-temporal feature fusion deep learning network
    Pei, Xiaomin
    Fan, Huijie
    Tang, Yandong
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (02):
  • [16] Online Learning for Multimodal Data Fusion With Application to Object Recognition
    Shahrampour, Shahin
    Noshad, Mohammad
    Ding, Jie
    Tarokh, Vahid
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2018, 65 (09) : 1259 - 1263
  • [17] DGTRL: Deep graph transfer reinforcement learning method based on fusion of knowledge and data
    Chen, Genxin
    Qi, Jin
    Gao, Yu
    Zhu, Xingjian
    Dong, Zhenjiang
    Sun, Yanfei
    INFORMATION SCIENCES, 2024, 658
  • [18] Deep Multimodal Data Fusion
    Zhao, Fei
    Zhang, Chengcui
    Geng, Baocheng
    ACM COMPUTING SURVEYS, 2024, 56 (09)
  • [19] Multimodal data fusion for cancer biomarker discovery with deep learning
    Steyaert, Sandra
    Pizurica, Marija
    Nagaraj, Divya
    Khandelwal, Priya
    Hernandez-Boussard, Tina
    Gentles, Andrew J.
    Gevaert, Olivier
    NATURE MACHINE INTELLIGENCE, 2023, 5 (04) : 351 - 362
  • [20] Multimodal data fusion for cancer biomarker discovery with deep learning
    Sandra Steyaert
    Marija Pizurica
    Divya Nagaraj
    Priya Khandelwal
    Tina Hernandez-Boussard
    Andrew J. Gentles
    Olivier Gevaert
    Nature Machine Intelligence, 2023, 5 : 351 - 362