Binary Hashing CNN Features for Action Recognition

被引:3
|
作者
Li, Weisheng [1 ]
Feng, Chen [1 ]
Xiao, Bin [2 ]
Chen, Yanquan [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Action Recognition; CNN Feature; Binary Hashing; Feature Normalization;
D O I
10.3837/tiis.2018.09.016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The purpose of this work is to solve the problem of representing an entire video using Convolutional Neural Network (CNN) features for human action recognition. Recently, due to insufficient GPU memory, it has been difficult to take the whole video as the input of the CNN for end-to-end learning. A typical method is to use sampled video frames as inputs and corresponding labels as supervision. One major issue of this popular approach is that the local samples may not contain the information indicated by the global labels and sufficient motion information. To address this issue, we propose a binary hashing method to enhance the local feature extractors. First, we extract the local features and aggregate them into global features using maximum/minimum pooling. Second, we use the binary hashing method to capture the motion features. Finally, we concatenate the hashing features with global features using different normalization methods to train the classifier. Experimental results on the JHMDB and MPII-Cooking datasets show that, for these new local features, binary hashing mapping on the sparsely sampled features led to significant performance improvements.
引用
收藏
页码:4412 / 4428
页数:17
相关论文
共 50 条
  • [1] Jointly Training of Binary 3D CNN Features for Action Recognition
    Cai, Yangang
    Xing, Peiyin
    Wang, Zhenyu
    Wang, Ronggang
    [J]. DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 446 - 446
  • [2] Binary Representation and High Efficient Compression of 3D CNN Features for Action Recognition
    Xing, Peiyin
    Peng, Peixi
    Liang, Yongsheng
    Huang, Tiejun
    Tian, Yonghong
    [J]. 2020 DATA COMPRESSION CONFERENCE (DCC 2020), 2020, : 400 - 400
  • [3] P-CNN: Pose-based CNN Features for Action Recognition
    Cheron, Guilhem
    Laptev, Ivan
    Schmid, Cordelia
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 3218 - 3226
  • [4] Action Recognition Using Multiple Pooling Strategies of CNN Features
    Hu, Haifeng
    Liao, Zhongke
    Xiao, Xiang
    [J]. NEURAL PROCESSING LETTERS, 2019, 50 (01) : 379 - 396
  • [5] Action Recognition Using Multiple Pooling Strategies of CNN Features
    Haifeng Hu
    Zhongke Liao
    Xiang Xiao
    [J]. Neural Processing Letters, 2019, 50 : 379 - 396
  • [6] EFFICIENT POOLING OF IMAGE BASED CNN FEATURES FOR ACTION RECOGNITION IN VIDEOS
    Banerjee, Biplab
    Murino, Vittorio
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2637 - 2641
  • [7] Supervised Hashing Binary Code with Deep CNN for Image Retrieval
    Li, Jun-yi
    Li, Jian-hua
    [J]. 2015 8TH INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS (BMEI), 2015, : 649 - 655
  • [8] Cascading Pose Features with CNN-LSTM for Multiview Human Action Recognition
    Malik, Najeeb ur Rehman
    Abu-Bakar, Syed Abdul Rahman
    Sheikh, Usman Ullah
    Channa, Asma
    Popescu, Nirvana
    [J]. SIGNALS, 2023, 4 (01): : 40 - 55
  • [9] Deep CNN Object Features for Improved Action Recognition in Low Quality Videos
    Rahman, Saimunur
    See, John
    Ho, Chiung Ching
    [J]. ADVANCED SCIENCE LETTERS, 2017, 23 (11) : 11360 - 11364
  • [10] Binary dense sift flow based two stream CNN for human action recognition
    Park, Sang Kyoo
    Chung, Jun Ho
    Kang, Tae Koo
    Lim, Myo Taeg
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 35697 - 35720