An ensemble approach for still image-based human action recognition

被引:5
|
作者
Banerjee, Avinandan [1 ]
Roy, Sayantan [1 ]
Kundu, Rohit [2 ]
Singh, Pawan Kumar [1 ]
Bhateja, Vikrant [3 ,4 ]
Sarkar, Ram [5 ]
机构
[1] Jadavpur Univ, Dept Informat Technol, Jadavpur Univ Second Campus,Plot 8,LB Block, Kolkata 700106, W Bengal, India
[2] Jadavpur Univ, Dept Elect Engn, 188 Raja SC Mallick Rd, Kolkata 700032, W Bengal, India
[3] Shri Ramswaroop Mem Coll Engn & Management, Dept Elect & Commun Engn, Faizabad Rd, Lucknow 226028, Uttar Pradesh, India
[4] Dr APJ Abdul Kalam Tech Univ, Lucknow, Uttar Pradesh, India
[5] Jadavpur Univ, Dept Comp Sci & Engn, 188 Raja SC Mallick Rd, Kolkata 700032, W Bengal, India
来源
NEURAL COMPUTING & APPLICATIONS | 2022年 / 34卷 / 21期
关键词
Human action recognition; Still images; Choquet integral; CNN; Attention module; Ensemble learning; INFORMATION FUSION; ATTENTION;
D O I
10.1007/s00521-022-07514-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Still-image based human action recognition is a challenging task in the field of computer vision due to the limited information available in a single image. Hence, it is important to efficiently extract visual cues and structural information from the image in the process of classification. To this end, in this work, we utilize the Convolutional neural network for classification, based on the DenseNet 201 architecture. To focus upon informative regions of interest, the spatial attention module has been trained as a feature extractor to emphasize features from selective parts of the input image. We further leverage an effective ensemble approach based upon fuzzy fusion through the Choquet integral, which harnesses the complementary uncertainty of decision scores. This allows for a robust decision-making process on the fly, based upon coalitions of the inputs. Experimental results upon three challenging datasets: PPMI and Stanford 40, known for their confusing action classes, and BU-101, known for its immense scale, support the efficacy of the proposed method.
引用
收藏
页码:19269 / 19282
页数:14
相关论文
共 50 条
  • [1] An ensemble approach for still image-based human action recognition
    Avinandan Banerjee
    Sayantan Roy
    Rohit Kundu
    Pawan Kumar Singh
    Vikrant Bhateja
    Ram Sarkar
    [J]. Neural Computing and Applications, 2022, 34 : 19269 - 19282
  • [2] A survey on still image based human action recognition
    Guo, Guodong
    Lai, Alice
    [J]. PATTERN RECOGNITION, 2014, 47 (10) : 3343 - 3361
  • [3] Still Image-based Human Activity Recognition with Deep Representations and Residual Learning
    Siyal, Ahsan Raza
    Bhutto, Zuhaibuddin
    Shah, Syed Muhammad Shehram
    Iqbal, Azhar
    Mehmood, Faraz
    Hussain, Ayaz
    Ahmed, Saleem
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 471 - 477
  • [4] Human action recognition using an image-based temporal and spatial representation
    Silva, Vinicius
    Soares, Filomena
    Esteves, Joao Sena
    Vercelli, Gianni
    [J]. 2020 12TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT 2020), 2020, : 41 - 46
  • [5] Deep Ensemble Learning for Human Action Recognition in Still Images
    Yu, Xiangchun
    Zhang, Zhe
    Wu, Lei
    Pang, Wei
    Chen, Hechang
    Yu, Zhezhou
    Li, Bin
    [J]. COMPLEXITY, 2020, 2020 (2020)
  • [6] Depth Image-based Object Segmentation Scheme for Improving Human Action Recognition
    Park, Sungjoo
    Park, Unsang
    Kim, Dongchil
    [J]. 2018 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2018, : 483 - 485
  • [7] Image-based Pose Representation for Action Recognition and Hand Gesture Recognition
    Lin, Zeyi
    Zhang, Wei
    Deng, Xiaoming
    Ma, Cuixia
    Wang, Hongan
    [J]. 2020 15TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2020), 2020, : 532 - 539
  • [8] Handling Imbalanced Data Using a Cascade Model for Image-Based Human Action Recognition
    Wahyono
    Suprapto
    Rezky, Adam
    Rokhman, Nur
    Jo, Kang-Hyun
    [J]. Journal of Computing Science and Engineering, 2023, 17 (04) : 207 - 215
  • [9] Ensemble classifier and its application to image-based MICR Character Recognition
    Zhang, Ping
    [J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 40 - 45
  • [10] An image-based approach for human communication on the WWW
    Ogawa, T
    Tsukamoto, M
    Loh, YH
    Nakao, T
    Nishio, S
    [J]. PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON COMMUNICATIONS, INTERNET, AND INFORMATION TECHNOLOGY, 2002, : 472 - 478