Space-Time Tree Ensemble for Action Recognition and Localization

被引:0
|
作者
Shugao Ma
Jianming Zhang
Stan Sclaroff
Nazli Ikizler-Cinbis
Leonid Sigal
机构
[1] Boston University,Computer Science
[2] Adobe Research,Computer Engineering
[3] Hacettepe University,undefined
[4] Disney Research,undefined
来源
关键词
Action recognition; Action localization; Space-time tree structure;
D O I
暂无
中图分类号
学科分类号
摘要
Human actions are, inherently, structured patterns of body movements. We explore ensembles of hierarchical spatio-temporal trees, discovered directly from training data, to model these structures for action recognition and spatial localization. Discovery of frequent and discriminative tree structures is challenging due to the exponential search space, particularly if one allows partial matching. We address this by first building a concise action word vocabulary via discriminative clustering of the hierarchical space-time segments, which is a two-level video representation that captures both static and non-static relevant space-time segments of the video. Using this vocabulary we then utilize tree mining with subsequent tree clustering and ranking to select a compact set of discriminative tree patterns. Our experiments show that these tree patterns, alone, or in combination with shorter patterns (action words and pairwise patterns) achieve promising performance on three challenging datasets: UCF Sports, HighFive and Hollywood3D. Moreover, we perform cross-dataset validation, using trees learned on HighFive to recognize the same actions in Hollywood3D, and using trees learned on UCF-Sports to recognize and localize the similar actions in JHMDB. The results demonstrate the potential for cross-dataset generalization of the trees our approach discovers.
引用
收藏
页码:314 / 332
页数:18
相关论文
共 50 条
  • [1] Space-Time Tree Ensemble for Action Recognition and Localization
    Ma, Shugao
    Zhang, Jianming
    Sclaroff, Stan
    Ikizler-Cinbis, Nazli
    Sigal, Leonid
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2018, 126 (2-4) : 314 - 332
  • [2] Space-Time Tree Ensemble for Action Recognition
    Ma, Shugao
    Sigal, Leonid
    Sclaroff, Stan
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5024 - 5032
  • [3] Action Recognition and Localization by Hierarchical Space-Time Segments
    Ma, Shugao
    Zhang, Jianming
    Ikizler-Cinbis, Nazli
    Sclaroff, Stan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2744 - 2751
  • [4] Space-time shapelets for action recognition
    Batra, Dhruv
    Chen, Tsuhan
    Sukthankar, Rahul
    [J]. 2008 IEEE WORKSHOP ON MOTION AND VIDEO COMPUTING, 2008, : 161 - 166
  • [5] Selected Space-Time Based Methods for Action Recognition
    Wojciechowski, Slawomir
    Kulbacki, Marek
    Segen, Jakub
    Wycislok, Rafal
    Bak, Artur
    Wereszczynski, Kamil
    Wojciechowski, Konrad
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT II, 2016, 9622 : 417 - 426
  • [6] Space-Time Robust Video Representation for Action Recognition
    Ballas, Nicolas
    Yang, Yi
    Lan, Zhen-zhong
    Delezoide, Betrand
    Preteux, Francoise
    Hauptmann, Alex
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2704 - 2711
  • [7] Space-Time Neighborhood Based Hierarchical Descriptor for Action Recognition
    Wang, Haoran
    Yuan, Chunfeng
    Hu, Weiming
    Sun, Changyin
    [J]. 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 95 - 99
  • [8] Group Action Recognition Using Space-Time Interest Points
    Wei, Qingdi
    Zhang, Xiaoqin
    Kong, Yu
    Hu, Weiming
    Ling, Haibin
    [J]. ADVANCES IN VISUAL COMPUTING, PT 2, PROCEEDINGS, 2009, 5876 : 757 - +
  • [9] Space-Time Localization and Mapping
    Lee, Minhaeng
    Fowlkes, Charless C.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3932 - 3941
  • [10] An ensemble approach to space-time interpolation
    Wentz, Elizabeth A.
    Peuquet, Donna J.
    Anderson, Sharolyn
    [J]. INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2010, 24 (09) : 1309 - 1325