A component-based video content representation for action recognition

被引:9
|
作者
Adeli, Vida [1 ]
Fazl-Ersi, Ehsan [1 ]
Harati, Ahad [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Mashhad 9177948944, Razavi Khorasan, Iran
关键词
Actionness likelihood; Action recognition; Action components; LSTM; Three-stream convolutional neural network; MOTION REPRESENTATION;
D O I
10.1016/j.imavis.2019.08.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the challenging problem of action recognition in videos and proposes a new component-based approach for video content representation. Although satisfactory performance for action recognition has already been obtained for certain scenarios, many of the existing solutions require fully-annotated video datasets in which region of the activity in each frame is specified by a bounding box. Another group of methods require auxiliary techniques to extract human-related areas in the video frames before being able to accurately recognize actions. In this paper, a Weakly-Supervised Learning (WSL) framework is introduced that eliminates the need for per-frame annotations and learns video representations that improve recognition accuracy and also highlights the activity related regions within each frame. To this end, two new representation ideas are proposed, one focus on representing the main components of an action, i.e. actionness regions, and the other focus on encoding the background context to represent general and holistic cues. A three-stream CNN is developed, which takes the two proposed representations and combines them with a motion-encoding stream. Temporal cues in each of the three different streams are modeled through LSTM, and finally fully-connected neural network layers are used to fuse various streams and produce the final video representation. Experimental results on four challenging datasets, demonstrate that the proposed Component-based Multi-stream CNN model (CM-CNN), trained on a WSL setting, outperforms the state-of-the-art in action recognition, even the fully-supervised approaches. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Sparse Low-Rank Component-Based Representation for Face Recognition With Low-Quality Images
    Yang, Shicheng
    Zhang, Le
    He, Lianghua
    Wen, Ying
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2019, 14 (01) : 251 - 261
  • [22] Robust and Effective Component-Based Banknote Recognition for the Blind
    Hasanuzzaman, Faiz M.
    Yang, Xiaodong
    Tian, YingLi
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1021 - 1030
  • [23] Component-based target recognition inspired by human vision
    Zheng, Yufeng
    Agyepong, Kwabena
    AUTOMATIC TARGET RECOGNITION XIX, 2009, 7335
  • [24] Face recognition: component-based versus global approaches
    Heisele, B
    Ho, P
    Wu, J
    Poggio, T
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 91 (1-2) : 6 - 21
  • [25] Invariant Feature Extraction for Component-based Facial Recognition
    Hassan, Adam
    Viriri, Serestina
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 695 - 698
  • [26] INCREMENTAL MACHINE LEARNING APPROACH FOR COMPONENT-BASED RECOGNITION
    Elgawi, Osman Hassab
    IMAGAPP 2009: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER IMAGING THEORY AND APPLICATIONS, 2009, : 5 - 12
  • [27] A Component-based Framework for the Internet Content Adaptation Domain
    Forte, Marcos
    Claudino, Renato A. T.
    de Souza, Wanderley Lopes
    do Prado, Antonio Francisco
    Santana, Luiz H. Z.
    APPLIED COMPUTING 2007, VOL 1 AND 2, 2007, : 1450 - +
  • [28] Component based representation for face recognition
    Wang Lijia
    Zhang Hua
    Wang Zhenjie
    PROCEEDINGS OF 2015 IEEE 12TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS (ICEMI), VOL. 3, 2015, : 1275 - 1278
  • [29] Improving knowledge representation, tutoring, and authoring in a component-based ILE
    Hunn, Charles
    Mavrikis, Manolis
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3220 : 827 - 829
  • [30] Improving knowledge representation, tutoring, and authoring in a component-based ILE
    Hunn, C
    Mavrikis, M
    INTELLIGENT TUTORING SYSTEMS, PROCEEDINGS, 2004, 3220 : 827 - 829