Deep Learning-Based Human Action Recognition with Key-Frames Sampling Using Ranking Methods

被引:8
|
作者
Tasnim, Nusrat [1 ]
Baek, Joong-Hwan [1 ]
机构
[1] Korea Aerosp Univ, Sch Elect & Informat Engn, Goyang 10540, South Korea
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 09期
关键词
human-machine or object interaction; human action recognition; deep learning; key frames sampling; ranking method;
D O I
10.3390/app12094165
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Nowadays, the demand for human-machine or object interaction is growing tremendously owing to its diverse applications. The massive advancement in modern technology has greatly influenced researchers to adopt deep learning models in the fields of computer vision and image-processing, particularly human action recognition. Many methods have been developed to recognize human activity, which is limited to effectiveness, efficiency, and use of data modalities. Very few methods have used depth sequences in which they have introduced different encoding techniques to represent an action sequence into the spatial format called dynamic image. Then, they have used a 2D convolutional neural network (CNN) or traditional machine learning algorithms for action recognition. These methods are completely dependent on the effectiveness of the spatial representation. In this article, we propose a novel ranking-based approach to select key frames and adopt a 3D-CNN model for action classification. We directly use the raw sequence instead of generating the dynamic image. We investigate the recognition results with various levels of sampling to show the competency and robustness of the proposed system. We also examine the universality of the proposed method on three benchmark human action datasets: DHA (depth-included human action), MSR-Action3D (Microsoft Action 3D), and UTD-MHAD (University of Texas at Dallas Multimodal Human Action Dataset). The proposed method secures better performance than state-of-the-art techniques using depth sequences.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Key frame and skeleton extraction for deep learning-based human action recognition
    Hai-Hong Phan
    Trung Tin Nguyen
    Ngo Huu Phuc
    Nguyen Huu Nhan
    Do Minh Hieu
    Cao Truong Tran
    Bao Ngoc Vi
    2021 RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF 2021), 2021, : 180 - 185
  • [2] Human Action Recognition Using a Depth Sequence Key-frames Based on Discriminative Collaborative Representation Classifier for Healthcare Analytics
    Wang, Yuhang
    Feng, Tao
    Zheng, Yi
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2022, 19 (03) : 1445 - 1462
  • [3] Deep Learning-Based Human Action Recognition in Videos
    Li, Song
    Shi, Qian
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2025, 34 (01)
  • [4] Color normalization for appearance based recognition of video key-frames
    Sánchez, JM
    Binefa, X
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 815 - 818
  • [5] Human Action Recognition Based on Key Frames
    Hu, Yong
    Zheng, Wei
    ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 535 - 542
  • [6] Human Action Recognition Using Deep Learning Methods
    Yu, Zeqi
    Yan, Wei Qi
    2020 35TH INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2020,
  • [7] Human activity recognition: A review of deep learning-based methods
    Dutta, Sanjay Jyoti
    Boongoen, Tossapon
    Zwiggelaar, Reyer
    IET COMPUTER VISION, 2025, 19 (01)
  • [8] Deep Learning-based Fast Hand Gesture Recognition using Representative Frames
    John, Vijay
    Boyali, Ali
    Mita, Seiichi
    Imanishi, Masayuki
    Sanma, Norio
    2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 31 - 38
  • [9] The Deep Learning-based Human Action Recognition System for Competitive Sports
    Wang, Xin
    Guo, Yingqing
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2024, 68 (03)
  • [10] An Efficient Method for Extracting Key-Frames from 3D Human Joint Locations for Action Recognition
    Kabir, Md Hasanul
    Ahmed, Ferdous
    Abdullah-Al-Tariq
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2015), 2015, 9164 : 277 - 284