Enhancing Human Action Recognition through Temporal Saliency

被引:0
|
作者
Adeli, Vida [1 ]
Fazl-Ersi, Ehsan [1 ]
Harati, Ahad [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad, Razavi Khorasan, Iran
关键词
Action recognition; Motion; Region proposal; Convolutional Neural Networks; Actionness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Images and videos have become ubiquitous in every aspects of life due to the growing digital recording devices. It has encouraged the development of algorithms that can analyze video content and perform human action recognition. This paper investigates the challenging problem of action recognition by outlining a new approach to represent a video sequence. A novel framework is developed to produce informative features for action labeling in a weakly-supervised learning (WSL) approach both during training and testing. Using appearance and motion information, the goal is to identify frame regions that are likely to contain actions. A three-stream convolutional neural network is adopted and improved by proposing a method based on extracting actionness regions. This results in less computation as it is processing only some parts of an RGB frame and also interpret less non-activity related regions, which can mislead the recognition system. We exploit UCF sports dataset as our evaluation benchmark, which is a dataset of realistic sports videos. We will show that our proposed approach could outperform other existing state-of-the art methods.
引用
收藏
页码:176 / 181
页数:6
相关论文
共 50 条
  • [1] Spatiotemporal saliency for human action recognition
    Oikonomopoulos, A
    Patras, I
    Pantic, M
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 430 - 433
  • [2] Enhancing Human Action Recognition through Spatio-temporal Feature Learning and Semantic Rules
    Ramirez-Amaro, Karinne
    Kim, Eun-Sol
    Kim, Jiseob
    Zhang, Byoung-Tak
    Beetz, Michael
    Cheng, Gordon
    2013 13TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2013, : 456 - 461
  • [3] TSI: Temporal saliency integration for video action recognition
    SenseTime Research
    不详
    不详
    不详
    arXiv, 1600,
  • [4] Mining Spatial Temporal Saliency Structure for Action Recognition
    Liu, Yinan
    Wu, Qingbo
    Xu, Linfeng
    Wu, Bo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2643 - 2646
  • [5] SPATIOTEMPORAL SALIENCY AND SUB ACTION SEGMENTATION FOR HUMAN ACTION RECOGNITION
    Babu, Abhishek
    Shyna, A.
    2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [6] Spatial-temporal saliency action mask attention network for action recognition
    Jiang, Min
    Pan, Na
    Kong, Jun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
  • [7] Enhancing Temporal Action Localization with Transfer Learning from Action Recognition
    Richard, Alexander
    Iqbal, Ahsan
    Gall, Juergen
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1533 - 1540
  • [8] Enhancing Human Action Recognition Through Transfer Learning and Body Articulation Analysis
    Jlidi, Nozha
    Jemai, Olfa
    Bouchrika, Tahani
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2025,
  • [9] Action recognition using saliency learned from recorded human gaze
    Stefic, Daria
    Patras, Ioannis
    IMAGE AND VISION COMPUTING, 2016, 52 : 195 - 205
  • [10] TEMPORAL SALIENCE BASED HUMAN ACTION RECOGNITION
    Al-Obaidi, Salah
    Abhayaratne, Charith
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2017 - 2021