Hollywood 3D: What are the Best 3D Features for Action Recognition?

被引:14
|
作者
Hadfield, Simon [1 ]
Lebeda, Karel [1 ]
Bowden, Richard [1 ]
机构
[1] Univ Surrey, CVSSP, Guildford GU27XH, Surrey, England
基金
瑞士国家科学基金会; 英国工程与自然科学研究理事会;
关键词
Action recognition; In the wild; 3D; Structure; Depth; 3D motion; Hollywood; Benchmark; SCALE;
D O I
10.1007/s11263-016-0917-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition "in the wild" is extremely challenging, particularly when complex 3D actions are projected down to the image plane, losing a great deal of information. The recent growth of 3D data in broadcast content and commercial depth sensors, makes it possible to overcome this. However, there is little work examining the best way to exploit this new modality. In this paper we introduce the Hollywood 3D benchmark, which is the first dataset containing "in the wild" action footage including 3D data. This dataset consists of 650 stereo video clips across 14 action classes, taken from Hollywood movies. We provide stereo calibrations and depth reconstructions for each clip. We also provide an action recognition pipeline, and propose a number of specialised depth-aware techniques including five interest point detectors and three feature descriptors. Extensive tests allow evaluation of different appearance and depth encoding schemes. Our novel techniques exploiting this depth allow us to reach performance levels more than triple those of the best baseline algorithm using only appearance information. The benchmark data, code and calibrations are all made available to the community.
引用
收藏
页码:95 / 110
页数:16
相关论文
共 50 条
  • [1] Hollywood 3D: What are the Best 3D Features for Action Recognition?
    Simon Hadfield
    Karel Lebeda
    Richard Bowden
    [J]. International Journal of Computer Vision, 2017, 121 : 95 - 110
  • [2] 3D GLOH Features for Human Action Recognition
    Abdulmunem, Ashwan
    Lai, Yu-Kun
    Sun, Xianfang
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 805 - 810
  • [3] 3D Measures and Geometric Facial Features Analysis on 3D Facial Action Unit Recognition
    Ujir, Hamimah
    Assan, Cecilia
    Hipiny, Irwandi
    Iskandar, D. N. F. Awang
    [J]. ADVANCED SCIENCE LETTERS, 2015, 21 (12) : 3703 - 3707
  • [4] Fusing Spatiotemporal Features and Joints for 3D Action Recognition
    Zhu, Yu
    Chen, Wenbin
    Guo, Guodong
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 486 - 491
  • [5] Deep 3D Flow Features for Human Action Recognition
    Psaltis, Athanasios
    Papadopoulos, Georgios Th
    Daras, Petros
    [J]. 2018 16TH INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2018,
  • [6] Robust 3D Local SIFT Features for 3D Face Recognition
    Ming, Yue
    Jin, Yi
    [J]. INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2015), PT III, 2015, 9246 : 352 - 359
  • [7] 3D TRAJECTORIES FOR ACTION RECOGNITION
    Koperski, Michal
    Bilinski, Piotr
    Bremond, Francois
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4176 - 4180
  • [8] 3D RANs: 3D Residual Attention Networks for action recognition
    Cai, Jiahui
    Hu, Jianguo
    [J]. VISUAL COMPUTER, 2020, 36 (06): : 1261 - 1270
  • [9] 3D RANs: 3D Residual Attention Networks for action recognition
    Jiahui Cai
    Jianguo Hu
    [J]. The Visual Computer, 2020, 36 : 1261 - 1270
  • [10] Hollywood 3D: Recognizing Actions in 3D Natural Scenes
    Hadfield, Simon
    Bowden, Richard
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3398 - 3405