A Deep Reinforcement Learning Method For Multimodal Data Fusion in Action Recognition

被引:18
|
作者
Guo, Jiale [1 ]
Liu, Qiang [1 ]
Chen, Enqing [1 ]
机构
[1] Zhengzhou Univ, Sch Informat Engn, Zhengzhou 450001, Peoples R China
关键词
Reinforcement learning; Resource management; Data models; Signal processing algorithms; Task analysis; Neural networks; Decision making; Multimodal action recognition; TD3; fusion weight allocation; deep reinforcement learning;
D O I
10.1109/LSP.2021.3128379
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
At present, in the research of multimodal human action recognition, the weighted fusion method with fixed weight is widely applied in the decision level fusion of most models. In this way, the weight is usually obtained from the original experience or traversal search, which is inaccurate or has a large amount of calculation, and ignores the different representation ability of various modal data for various classes of action information. With the help of the powerful decision-making ability of deep reinforcement learning, we propose a multimodal decision-making fusion weight allocation network based on deep reinforcement learning. This letter mainly discusses the design of the model, which involves the modeling of reinforcement learning problem in action recognition, the design of neural network and the selection of problem-solving scheme. Experimental results on NTU RGB + D and HMDB51 datasets show the effectiveness of the proposed method.
引用
收藏
页码:120 / 124
页数:5
相关论文
共 50 条
  • [41] An Improved Multimodal Trajectory Prediction Method Based on Deep Inverse Reinforcement Learning
    Chen, Ting
    Guo, Changxin
    Li, Hao
    Gao, Tao
    Chen, Lei
    Tu, Huizhao
    Yang, Jiangtian
    ELECTRONICS, 2022, 11 (24)
  • [42] Tennis players' hitting action recognition method based on multimodal data
    Liu, Song
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2024, 16 (3-4) : 317 - 336
  • [43] Attention-Aware Sampling via Deep Reinforcement Learning for Action Recognition
    Dong, Wenkai
    Zhang, Zhaoxiang
    Tan, Tieniu
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8247 - 8254
  • [44] Research on a Fusion Method of Spatial Relationship and Memory in Deep Reinforcement Learning
    Liu H.-L.
    Liu P.
    Bai C.-J.
    Jisuanji Xuebao/Chinese Journal of Computers, 2023, 46 (04): : 814 - 826
  • [45] Cosmo: Contrastive Fusion Learning with Small Data for Multimodal Human Activity Recognition
    Ouyang, Xiaomin
    Shuai, Xian
    Zhou, Jiayu
    Shi, Ivy Wang
    Xie, Zhiyuan
    Xing, Guoliang
    Huang, Jianwei
    PROCEEDINGS OF THE 2022 THE 28TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, ACM MOBICOM 2022, 2022, : 324 - 337
  • [46] Interactive Reinforcement Learning With Bayesian Fusion of Multimodal Advice
    Trick, Susanne
    Herbert, Franziska
    Rothkopf, Constantin A.
    Koert, Dorothea
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 7558 - 7565
  • [47] Human Motion Pose Rapid Tracking Using Improved Deep Reinforcement Learning and Multimodal Fusion
    Li, Zhipeng
    Yang, Zengbao
    Yang, Ruizhu
    Wang, Nan
    Song, Wenli
    Zhang, Xingfu
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2025,
  • [48] Multimodal fusion: A study on speech-text emotion recognition with the integration of deep learning
    Shang, Yanan
    Fu, Tianqi
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 24
  • [49] A multimodal fusion emotion recognition method based on multitask learning and attention mechanism
    Xie, Jinbao
    Wang, Jiyu
    Wang, Qingyan
    Yang, Dali
    Gu, Jinming
    Tang, Yongqiang
    Varatnitski, Yury I.
    NEUROCOMPUTING, 2023, 556
  • [50] Adaptive Multimodal Fusion for Facial Action Units Recognition
    Yang, Huiyuan
    Wang, Taoyue
    Yin, Lijun
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2982 - 2990