Conflux LSTMs Network: A Novel Approach for Multi-View Action Recognition

被引:43
|
作者
Ullah, Amin [1 ]
Muhammad, Khan [2 ]
Hussain, Tanveer [1 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Seoul, South Korea
[2] Sejong Univ, Dept Software, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Artificial intelligence; Deep learning; Action recognition; Multi-view video analytics; Sequence learning; LSTM; CNN; Multi-view action recognition; NEURAL-NETWORKS; SURVEILLANCE; FEATURES;
D O I
10.1016/j.neucom.2019.12.151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-view action recognition (MVAR) is an optimal technique to acquire numerous clues from different views data for effective action recognition, however, it is not well explored yet. There exist several challenges to MVAR domain such as divergence in viewpoints, invisible regions, and different scales of appearance in each view require better solutions for real world applications. In this paper, we present a conflux long short-term memory (LSTMs) network to recognize actions from multi-view cameras. The proposed framework has four major steps; 1) frame level feature extraction, 2) its propagation through conflux LSTMs network for view self-reliant patterns learning, 3) view inter-reliant patterns learning and correlation computation, and 4) action classification. First, we extract deep features from a sequence of frames using a pre-trained VGG19 CNN model for each view. Second, we forward the extracted features to conflux LSTMs network to learn the view self-reliant patterns. In the next step, we compute the inter-view correlations using the pairwise dot product from output of the LSTMs network corresponding to different views to learn the view inter-reliant patterns. In the final step, we use flatten layers followed by SoftMax classifier for action recognition. Experimental results over benchmark datasets compared to state-of-the-art report an increase of 3% and 2% on northwestern-UCLA and MCAD datasets, respectively. (c) 2021 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:321 / 329
页数:9
相关论文
共 50 条
  • [21] Multi-view neural network based gait recognition
    Fazli, Saeid
    Askarifar, Hadis
    Shoaie, Maryam Sheikh
    [J]. World Academy of Science, Engineering and Technology, 2010, 43 : 705 - 709
  • [22] Multi-manifold Approach to Multi-view Face Recognition
    Zaki, Shireen Mohd
    Yin, Hujun
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 370 - 377
  • [23] MLRMV: Multi-layer representation for multi-view action recognition
    Liu, Zhigang
    Yin, Ziyang
    Wu, Yin
    [J]. IMAGE AND VISION COMPUTING, 2021, 116 (116)
  • [24] Multi-View and Multi-Modal Action Recognition with Learned Fusion
    Ardianto, Sandy
    Hang, Hsueh-Ming
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1601 - 1604
  • [25] Regularized Multi-View Multi-Metric Learning for Action Recognition
    Wu, Xuqing
    Shah, Shishir K.
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 471 - 476
  • [26] A novel multi-view object recognition in complex background
    Chang, Yongxin
    Yu, Huapeng
    Xu, Zhiyong
    Fu, Chengyu
    Gao, Chunming
    [J]. XX INTERNATIONAL SYMPOSIUM ON HIGH-POWER LASER SYSTEMS AND APPLICATIONS 2014, 2015, 9255
  • [27] Neural representation and learning for multi-view human action recognition
    Iosifidis, Alexandros
    Tefas, Anastasios
    Pitas, Ioannis
    [J]. 2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [28] Learning Multi-View Interactional Skeleton Graph for Action Recognition
    Wang, Minsi
    Ni, Bingbing
    Yang, Xiaokang
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6940 - 6954
  • [29] Multi-View Action Recognition by Cross-domain Learning
    Nie, Weizhi
    Liu, Anan
    Yu, Jing
    Su, Yuting
    Chaisorn, Lekha
    Wang, Yongkang
    Kankanhalli, Mohan S.
    [J]. 2014 IEEE 16TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2014,
  • [30] Jointly Learning Multi-view Features for Human Action Recognition
    Wang, Ruoshi
    Liu, Zhigang
    Yin, Ziyang
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 4858 - 4861