First-Person Action Recognition With Temporal Pooling and Hilbert-Huang Transform

被引:9
|
作者
Purwanto, Didik [1 ]
Chen, Yie-Tarng [1 ]
Fang, Wen-Hsien [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Elect & Comp Engn, Taipei 106, Taiwan
关键词
Hilbert-Huang transform; temporal pooling; video descriptor; temporal aggregation; first-person videos; action recognition; EMPIRICAL MODE DECOMPOSITION; CLASSIFICATION; FEATURES; VISION; DENSE;
D O I
10.1109/TMM.2019.2919434
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a convolutional neural network (CNN)-based approach for first-person action recognition with a combination of temporal pooling and the Hilbert-Huang transform (HHT). The new approach first adaptively performs temporal subaction localization, treats each channel of the extracted trajectory pooled CNN features as a time series, and summarizes the temporal dynamic information in each sub-action by temporal pooling. The temporal evolution across sub-actions is then modeled by rank pooling. Thereafter, to account for the highly dynamic scene changes in first-person videos, the HHT is employed to decompose the ranked pooling features into finite and often few data-dependent functions, called intrinsic mode functions (IMFs), through empirical mode decomposition. Hilbert spectral analysis is then applied to each IMF component, and four salient descriptors are scrutinized and aggregated into the final video descriptor. Such a framework cannot only precisely acquire both long- and short-term tendencies, but also address the cumbersome significant camera motion in first-person videos to render better accuracy. Furthermore, it works well for complex actions for limited training samples. Simulations show that the proposed approach outperforms the main state-of-the-art methods when applied to four publicly available first-person video datasets.
引用
收藏
页码:3122 / 3135
页数:14
相关论文
共 50 条
  • [1] TEMPORAL AGGREGATION FOR FIRST-PERSON ACTION RECOGNITION USING HILBERT-HUANG TRANSFORM
    Purwanto, Didik
    Chen, Yie-Tarng
    Fang, Wen-Hsien
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 895 - 900
  • [2] IRIS RECOGNITION BASED ON HILBERT-HUANG TRANSFORM
    Yang, Zhijing
    Yang, Zhihua
    Yang, Lihua
    [J]. ADVANCES IN DATA SCIENCE AND ADAPTIVE ANALYSIS, 2009, 1 (04) : 623 - 641
  • [3] Hilbert-Huang Transform and the Application
    Liu, Yi
    An, Hao
    Bian, Shuangshuang
    [J]. PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 534 - 539
  • [4] The summary of Hilbert-Huang transform
    Song Shi-De
    Yao Zhi-chao
    Wang Xiao-na
    [J]. INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2013: INFRARED IMAGING AND APPLICATIONS, 2013, 8907
  • [5] Blasting wave pattern recognition based on Hilbert-Huang transform
    Li, Xuelong
    Wang, Enyuan
    Li, Zhonghui
    Bie, Xiaofei
    Chen, Liang
    Feng, Junjun
    Li, Nan
    [J]. GEOMECHANICS AND ENGINEERING, 2016, 11 (05) : 607 - 624
  • [6] Speech Recognition using Hilbert-Huang Transform Based Features
    Hanna, Samer S.
    Korany, Noha
    Abd-el-Malek, Mina B.
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 338 - 341
  • [7] Hilbert-Huang Transform in Fault Detection
    German-Sallo, Zoltan
    Grif, Horatiu Stefan
    [J]. 12TH INTERNATIONAL CONFERENCE INTERDISCIPLINARITY IN ENGINEERING (INTER-ENG 2018), 2019, 32 : 591 - 595
  • [8] Modulation Classification By Hilbert-Huang Transform
    Tanc, Yesim Hekim
    Akan, Aydm
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [9] An algorithm for improving Hilbert-Huang transform
    Guo, Song
    Gu, Guochang
    Li, Changyou
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 137 - +
  • [10] Mode Decomposition and the Hilbert-Huang Transform
    Ompokov, V. D.
    Boronoev, V. V.
    [J]. 2019 RUSSIAN OPEN CONFERENCE ON RADIO WAVE PROPAGATION (RWP), VOL 1, 2019, : 222 - 223