Binary Hashing CNN Features for Action Recognition

被引：3

作者：

Li, Weisheng ^{[1
]}

Feng, Chen ^{[1
]}

Xiao, Bin ^{[2
]}

Chen, Yanquan ^{[2
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China

[2] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China

来源：

KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2018年 / 12卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Action Recognition; CNN Feature; Binary Hashing; Feature Normalization;

D O I：

10.3837/tiis.2018.09.016

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The purpose of this work is to solve the problem of representing an entire video using Convolutional Neural Network (CNN) features for human action recognition. Recently, due to insufficient GPU memory, it has been difficult to take the whole video as the input of the CNN for end-to-end learning. A typical method is to use sampled video frames as inputs and corresponding labels as supervision. One major issue of this popular approach is that the local samples may not contain the information indicated by the global labels and sufficient motion information. To address this issue, we propose a binary hashing method to enhance the local feature extractors. First, we extract the local features and aggregate them into global features using maximum/minimum pooling. Second, we use the binary hashing method to capture the motion features. Finally, we concatenate the hashing features with global features using different normalization methods to train the classifier. Experimental results on the JHMDB and MPII-Cooking datasets show that, for these new local features, binary hashing mapping on the sparsely sampled features led to significant performance improvements.

引用

页码：4412 / 4428

页数：17

共 50 条

[1] Jointly Training of Binary 3D CNN Features for Action Recognition
Cai, Yangang
Xing, Peiyin
Wang, Zhenyu
Wang, Ronggang
[J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 446 - 446
[2] Binary Representation and High Efficient Compression of 3D CNN Features for Action Recognition
Xing, Peiyin
Peng, Peixi
Liang, Yongsheng
Huang, Tiejun
Tian, Yonghong
[J]. 2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 400 - 400
[3] P-CNN: Pose-based CNN Features for Action Recognition
Cheron, Guilhem
Laptev, Ivan
Schmid, Cordelia
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3218 - 3226
[4] Action Recognition Using Multiple Pooling Strategies of CNN Features
Hu, Haifeng
Liao, Zhongke
Xiao, Xiang
[J]. NEURAL PROCESSING LETTERS, 2019, 50 (01) : 379 - 396
[5] Action Recognition Using Multiple Pooling Strategies of CNN Features
Haifeng Hu
Zhongke Liao
Xiang Xiao
[J]. Neural Processing Letters, 2019, 50 : 379 - 396
[6] EFFICIENT POOLING OF IMAGE BASED CNN FEATURES FOR ACTION RECOGNITION IN VIDEOS
Banerjee, Biplab
Murino, Vittorio
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2637 - 2641
[7] Supervised Hashing Binary Code with Deep CNN for Image Retrieval
Li, Jun-yi
Li, Jian-hua
[J]. 2015 8TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI), 2015, : 649 - 655
[8] Cascading Pose Features with CNN-LSTM for Multiview Human Action Recognition
Malik, Najeeb ur Rehman
Abu-Bakar, Syed Abdul Rahman
Sheikh, Usman Ullah
Channa, Asma
Popescu, Nirvana
[J]. SIGNALS, 2023, 4 (01): : 40 - 55
[9] Deep CNN Object Features for Improved Action Recognition in Low Quality Videos
Rahman, Saimunur
See, John
Ho, Chiung Ching
[J]. ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 11360 - 11364
[10] Binary dense sift flow based two stream CNN for human action recognition
Park, Sang Kyoo
Chung, Jun Ho
Kang, Tae Koo
Lim, Myo Taeg
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 35697 - 35720

← 1 2 3 4 5 →