Reassessing Hierarchical Representation for Action Recognition in Still Images

被引:6
|
作者
Li, Rui [1 ]
Liu, Zhenyu [1 ]
Tan, Jianrong [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Zhejiang, Peoples R China
来源
IEEE ACCESS | 2018年 / 6卷
基金
中国国家自然科学基金;
关键词
Action recognition; hierarchical representation; Fisher vector; spatial pyramid; CLASSIFICATION; MODEL;
D O I
10.1109/ACCESS.2018.2872798
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Typical still action recognition methods rely on human body part detection and object detection. However, current human body part detectors and object detectors are far from perfect, leading to a negative impact on subsequent spatial relation learning of human-object interactions (HOIs). Bag-of-features (BoF)based methods go beyond such modeling paradigms, but they do not achieve the state-of-the-art accuracies. In this paper, we propose two still action recognition methods that model HOI layouts by image hierarchical representation, rather than explicitly constructing HOT relations. The first method encodes a dense set of SIFT features using Fisher vectors, where an image is divided into increasingly fine regions with the spatial pyramid. The second method takes recent pretrained deep networks as feature execrators, where an image is divided into overlapped regions. The improvement effect of the hierarchical representation is proven by extensive comparison experiments. Our methods are very simple and easy-to-use, which remarkably outperform those BoF-based methods and complicated human-centric methods. To the best of our knowledge, our methods achieve the highest accuracies to date on the Sports, PPMI, and extended PPMI data sets.
引用
收藏
页码:61386 / 61400
页数:15
相关论文
共 50 条
  • [1] Learning Hierarchical Context for Action Recognition in Still Images
    Zhu, Haisheng
    Hu, Jian-Fang
    Zheng, Wei-Shi
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 67 - 77
  • [2] Hierarchical Spatial Sum-Product Networks for Action Recognition in Still Images
    Wang, Jinghua
    Wang, Gang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (01) : 90 - 100
  • [3] Coloring Action Recognition in Still Images
    Khan, Fahad Shahbaz
    Anwer, Rao Muhammad
    van de Weijer, Joost
    Bagdanov, Andrew D.
    Lopez, Antonio M.
    Felsberg, Michael
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 105 (03) : 205 - 221
  • [4] Coloring Action Recognition in Still Images
    Fahad Shahbaz Khan
    Rao Muhammad Anwer
    Joost van de Weijer
    Andrew D. Bagdanov
    Antonio M. Lopez
    Michael Felsberg
    [J]. International Journal of Computer Vision, 2013, 105 : 205 - 221
  • [5] Understanding action recognition in still images
    Girish, Deeptha
    Singh, Vineeta
    Ralescu, Anca
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1523 - 1529
  • [6] Learning hierarchical video representation for action recognition
    Li Q.
    Qiu Z.
    Yao T.
    Mei T.
    Rui Y.
    Luo J.
    [J]. International Journal of Multimedia Information Retrieval, 2017, 6 (1) : 85 - 98
  • [7] Hierarchical Posture Representation for Robust Action Recognition
    Chen, Yi
    Yu, Li
    Ota, Kaoru
    Dong, Mianxiong
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (05): : 1115 - 1125
  • [8] A hierarchical representation for human action recognition in realistic scenes
    Qing Lei
    Hongbo Zhang
    Minghai Xin
    Yiqiao Cai
    [J]. Multimedia Tools and Applications, 2018, 77 : 11403 - 11423
  • [9] A hierarchical representation for human action recognition in realistic scenes
    Lei, Qing
    Zhang, Hongbo
    Xin, Minghai
    Cai, Yiqiao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (09) : 11403 - 11423
  • [10] Temporal Hallucinating for Action Recognition with Few Still Images
    Wang, Yali
    Zhou, Lei
    Qiao, Yu
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5314 - 5322