LEARNED SPATIO-TEMPORAL TEXTURE DESCRIPTORS FOR RGB-D HUMAN ACTION RECOGNITION

被引：1

作者：

Zhai, Zhengyuan ^{[1
]}

Fan, Chunxiao ^{[1
]}

Ming, Yue ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing Key Lab Work Safety Intelligent Monitorin, Xitucheng Rd 10, Beijing 100876, Peoples R China

来源：

COMPUTING AND INFORMATICS | 2018年 / 37卷 / 06期

关键词：

3D pixel differences vectors; compact binary face descriptor; feature fusion; human action recognition; RGB-depth videos; ENSEMBLE; FEATURES;

D O I：

10.4149/cai_2018_6_1339

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the recent arrival of Kinect, action recognition with depth images has attracted researchers' wide attentions and various descriptors have been proposed, where Local Binary Patterns (LBP) texture descriptors possess the properties of appearance invariance. However, the LBP and its variants are most artificially-designed, demanding engineers' strong prior knowledge and not discriminative enough for recognition tasks. To this end, this paper develops compact spatio-temporal texture descriptors, i.e. 3D-compact LBP(3D-CLBP) and local depth patterns (3D-CLDP), for color and depth videos in the light of compact binary face descriptor learning in face recognition. Extensive experiments performed on three standard datasets, 3D Online Action, MSR Action Pairs and MSR Daily Activity 3D, demonstrate that our method is superior to most comparative methods in respects of performance and can capture spatial-temporal texture cues in videos.

引用

页码：1339 / 1362

页数：24

共 50 条

[31] Human Action Recognition Based on Temporal Pyramid of Key Poses Using RGB-D Sensors
Cippitelli, Enea
Gambi, Ennio
Spinsante, Susanna
Florez-Revuelta, Francisco
[J]. ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2016, 2016, 10016 : 510 - 521
[32] Spatio-Temporal Action Localization for Human Action Recognition in Large Dataset
Megrhi, Sameh
Jmal, Marwa
Beghdadi, Azeddine
Mseddi, Wided
[J]. VIDEO SURVEILLANCE AND TRANSPORTATION IMAGING APPLICATIONS 2015, 2015, 9407
[33] HOG and HOOF Spatio-Temporal Descriptors for Gesture Recognition
Agab, Salah Eddine
Chelali, Fatma Zohra
[J]. 2018 INTERNATIONAL CONFERENCE ON SIGNAL, IMAGE, VISION AND THEIR APPLICATIONS (SIVA), 2018,
[34] RGB-D Face Recognition With Texture and Attribute Features
Goswami, Gaurav
Vatsa, Mayank
Singh, Richa
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (10) : 1629 - 1640
[35] 3D Texture Recognition for RGB-D Images
Zhong, Guoqiang
Mao, Xin
Shi, Yaxin
Dong, Junyu
[J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, CAIP 2015, PT II, 2015, 9257 : 518 - 528
[36] ACTION RECOGNITION IN RGB-D EGOCENTRIC VIDEOS
Tang, Yansong
Tian, Yi
Lu, Jiwen
Feng, Jianjiang
Zhou, Jie
[J]. 2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3410 - 3414
[37] 3D Gait Recognition Using Spatio-Temporal Motion Descriptors
Kwolek, Bogdan
Krzeszowski, Tomasz
Michalczuk, Agnieszka
Josinski, Henryk
[J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, 2014, 8398 : 595 - 604
[38] Structured Images for RGB-D Action Recognition
Wang, Pichao
Wang, Shuang
Gao, Zhimin
Hou, Yonghong
Li, Wanqing
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 1005 - 1014
[39] Temporal cues enhanced multimodal learning for action recognition in RGB-D videos
Liu, Dan
Meng, Fanrong
Xia, Qing
Ma, Zhiyuan
Mi, Jinpeng
Gan, Yan
Ye, Mao
Zhang, Jianwei
[J]. NEUROCOMPUTING, 2024, 594
[40] A new framework of action recognition with discriminative parts, spatio-temporal and causal interaction descriptors
Tong, Ming
Chen, Yiran
Zhao, Mengao
Tian, Weijuan
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 116 - 130

← 1 2 3 4 5 →