LEARNED SPATIO-TEMPORAL TEXTURE DESCRIPTORS FOR RGB-D HUMAN ACTION RECOGNITION

被引:1
|
作者
Zhai, Zhengyuan [1 ]
Fan, Chunxiao [1 ]
Ming, Yue [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Work Safety Intelligent Monitorin, Xitucheng Rd 10, Beijing 100876, Peoples R China
关键词
3D pixel differences vectors; compact binary face descriptor; feature fusion; human action recognition; RGB-depth videos; ENSEMBLE; FEATURES;
D O I
10.4149/cai_2018_6_1339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the recent arrival of Kinect, action recognition with depth images has attracted researchers' wide attentions and various descriptors have been proposed, where Local Binary Patterns (LBP) texture descriptors possess the properties of appearance invariance. However, the LBP and its variants are most artificially-designed, demanding engineers' strong prior knowledge and not discriminative enough for recognition tasks. To this end, this paper develops compact spatio-temporal texture descriptors, i.e. 3D-compact LBP(3D-CLBP) and local depth patterns (3D-CLDP), for color and depth videos in the light of compact binary face descriptor learning in face recognition. Extensive experiments performed on three standard datasets, 3D Online Action, MSR Action Pairs and MSR Daily Activity 3D, demonstrate that our method is superior to most comparative methods in respects of performance and can capture spatial-temporal texture cues in videos.
引用
收藏
页码:1339 / 1362
页数:24
相关论文
共 50 条
  • [31] Human Action Recognition Based on Temporal Pyramid of Key Poses Using RGB-D Sensors
    Cippitelli, Enea
    Gambi, Ennio
    Spinsante, Susanna
    Florez-Revuelta, Francisco
    [J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2016, 2016, 10016 : 510 - 521
  • [32] Spatio-Temporal Action Localization for Human Action Recognition in Large Dataset
    Megrhi, Sameh
    Jmal, Marwa
    Beghdadi, Azeddine
    Mseddi, Wided
    [J]. VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS 2015, 2015, 9407
  • [33] HOG and HOOF Spatio-Temporal Descriptors for Gesture Recognition
    Agab, Salah Eddine
    Chelali, Fatma Zohra
    [J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
  • [34] RGB-D Face Recognition With Texture and Attribute Features
    Goswami, Gaurav
    Vatsa, Mayank
    Singh, Richa
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (10) : 1629 - 1640
  • [35] 3D Texture Recognition for RGB-D Images
    Zhong, Guoqiang
    Mao, Xin
    Shi, Yaxin
    Dong, Junyu
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 518 - 528
  • [36] ACTION RECOGNITION IN RGB-D EGOCENTRIC VIDEOS
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    [J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3410 - 3414
  • [37] 3D Gait Recognition Using Spatio-Temporal Motion Descriptors
    Kwolek, Bogdan
    Krzeszowski, Tomasz
    Michalczuk, Agnieszka
    Josinski, Henryk
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, 2014, 8398 : 595 - 604
  • [38] Structured Images for RGB-D Action Recognition
    Wang, Pichao
    Wang, Shuang
    Gao, Zhimin
    Hou, Yonghong
    Li, Wanqing
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1005 - 1014
  • [39] Temporal cues enhanced multimodal learning for action recognition in RGB-D videos
    Liu, Dan
    Meng, Fanrong
    Xia, Qing
    Ma, Zhiyuan
    Mi, Jinpeng
    Gan, Yan
    Ye, Mao
    Zhang, Jianwei
    [J]. NEUROCOMPUTING, 2024, 594
  • [40] A new framework of action recognition with discriminative parts, spatio-temporal and causal interaction descriptors
    Tong, Ming
    Chen, Yiran
    Zhao, Mengao
    Tian, Weijuan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 116 - 130