Binary dense sift flow based two stream CNN for human action recognition

被引:8
|
作者
Park, Sang Kyoo [1 ]
Chung, Jun Ho [1 ]
Kang, Tae Koo [2 ]
Lim, Myo Taeg [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
[2] Sangmyung Univ, Dept Human Intelligence & Robot Engn, Cheonan, South Korea
基金
新加坡国家研究基金会;
关键词
Action recognition; Binary dense SIFT flow; Binary descriptor; Two-Stream CNN;
D O I
10.1007/s11042-021-10795-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Two-stream CNN is a widely-used network for human action recognition. Two-stream CNN consists of a spatial stream and a temporal stream. The spatial stream, through which the RGB image passes, extracts the shape features of human motion. The temporal stream, through which the optical flow images pass, extracts the sequence features of the listed motions. However, because of the constraints of the optical flow, such as brightness, constancy, and piecewise smoothness, there are limitations to the performance of two-stream CNN. One of the efficient methods to solve this problem is to expand the network model to a three-stream network, fuse it with LSTM, and add a modified pooling layer. This method improves the performance of the model but it increases the computational cost. Besides, the limitations of the optical flow are still present. In this paper, without extending the network model, a binary dense SIFT flow-based two-stream CNN is used instead of the optical flow. Unlike the optical flow, binary dense SIFT flow, which is a feature-based matching flow field is robust in brightness, constancy and piecewise smoothness. To evaluate the binary dense SIFT flow-based two-stream CNN, the UCF-101 dataset was selected for human action recognition. Furthermore, to evaluate the robustness of its brightness constancy and piecewise smoothness, a custom dataset was made up of classes that were extracted from UCF-101. Finally, the proposed method was compared with the state-of-the-art, which uses an optical flow-based two-stream CNN.
引用
收藏
页码:35697 / 35720
页数:24
相关论文
共 50 条
  • [1] Binary dense sift flow based two stream CNN for human action recognition
    Sang Kyoo Park
    Jun Ho Chung
    Tae Koo Kang
    Myo Taeg Lim
    [J]. Multimedia Tools and Applications, 2021, 80 : 35697 - 35720
  • [2] Binary Dense SIFT Flow Based Position-Information Added Two-Stream CNN for Pedestrian Action Recognition
    Park, Sang Kyoo
    Chung, Jun Ho
    Pae, Dong Sung
    Lim, Myo Taeg
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [3] Recognition of Human Action and Identification Based on SIFT and Watermark
    Ali, Khawlah Hussein
    Wang, Tianjiang
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 298 - 309
  • [4] Enhanced Spatial Stream of Two-Stream Network Using Optical Flow for Human Action Recognition
    Khan, Shahbaz
    Hassan, Ali
    Hussain, Farhan
    Perwaiz, Aqib
    Riaz, Farhan
    Alsabaan, Maazen
    Abdul, Wadood
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (14):
  • [5] Action detection based on tracklets with the two-stream CNN
    Minwen Zhang
    Chenqiang Gao
    Qiang Li
    Lan Wang
    Jiayao Zhang
    [J]. Multimedia Tools and Applications, 2018, 77 : 3303 - 3316
  • [6] Action detection based on tracklets with the two-stream CNN
    Zhang, Minwen
    Gao, Chenqiang
    Li, Qiang
    Wang, Lan
    Zhang, Jiayao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 3303 - 3316
  • [7] Human Action Recognition Based on Improved Two-Stream Convolution Network
    Wang, Zhongwen
    Lu, Haozhu
    Jin, Junlan
    Hu, Kai
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (12):
  • [8] Human Action Recognition Based on a Two-stream Convolutional Network Classifier
    Silva, Vincius de Oliveira
    Vidal, Flavio de Barros
    Soares Romariz, Alexandre Ricardo
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 774 - 778
  • [9] Hybrid two-stream dynamic CNN for view adaptive human action recognition using ensemble learning
    Muhammad Hafeez Javed
    Zeng Yu
    Tianrui Li
    Taha M. Rajeh
    Fahad Rafique
    Syed Waqar
    [J]. International Journal of Machine Learning and Cybernetics, 2022, 13 : 1157 - 1166
  • [10] Binary Hashing CNN Features for Action Recognition
    Li, Weisheng
    Feng, Chen
    Xiao, Bin
    Chen, Yanquan
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (09): : 4412 - 4428