Learning multi-temporal-scale deep information for action recognition

被引:25
|
作者
Yao, Guangle [1 ,2 ,3 ]
Lei, Tao [1 ]
Zhong, Jiandan [1 ,2 ,3 ]
Jiang, Ping [1 ]
机构
[1] Chinese Acad Sci, Inst Opt & Elect, Chengdu, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Action recognition; Convolutional neural networks; Deep learning; Spatiotemporal information; HISTOGRAMS; NETWORKS;
D O I
10.1007/s10489-018-1347-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Action recognition in video is widely applied in video indexing, intelligent surveillance, multimedia understanding, and other fields. A typical human action contains the spatiotemporal information from various scales. Learning and fusing the multi-temporal-scale information make action recognition more reliable in terms of recognition accuracy. To demonstrate this argument, in this paper, we use Res3D, a 3D Convolution Neural Network (CNN) architecture, to extract information in multiple temporal scales. And in each temporal scale, we transfer the knowledge learned from RGB to 3-channel optical flow (OF) and learn information from RGB and OF fields. We also propose Parallel Pair Discriminant Correlation Analysis (PPDCA) to fuse the multi-temporal-scale information into action representation with a lower dimension. Experimental results show that compared with single-temporal-scale method, the proposed multi-temporal-scale method gains higher recognition accuracy, and spends more time on feature extraction, but less time on classification due to the representation with lower dimension. Moreover, the proposed method achieves recognition performance comparable to that of the state-of-the-art methods. The source code and 3D filter animations are available online: https://github.com/JerryYaoGl/multi-temporal-scale.
引用
收藏
页码:2017 / 2029
页数:13
相关论文
共 50 条
  • [31] A Spatio-Temporal Deep Learning Approach For Human Action Recognition in Infrared Videos
    Shah, Anuj K.
    Ghosh, Ripul
    Akula, Aparna
    OPTICS AND PHOTONICS FOR INFORMATION PROCESSING XII, 2018, 10751
  • [32] Human Action Recognition by Learning Spatio-Temporal Features With Deep Neural Networks
    Wang, Lei
    Xu, Yangyang
    Cheng, Jun
    Xia, Haiying
    Yin, Jianqin
    Wu, Jiaji
    IEEE ACCESS, 2018, 6 : 17913 - 17922
  • [33] Action recognition method of spatio-temporal feature fusion deep learning network
    Pei, Xiaomin
    Fan, Huijie
    Tang, Yandong
    Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2018, 47 (02):
  • [34] Multi-views Action Recognition on Deep Learning and K-SVD
    Wang, Chuanxu
    Hu, Guofeng
    Liu, Yun
    2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [35] Multi-Layered Deep Learning Features Fusion for Human Action Recognition
    Kiran, Sadia
    Khan, Muhammad Attique
    Javed, Muhammad Younus
    Alhaisoni, Majed
    Tariq, Usman
    Nam, Yunyoung
    Damasevicius, Robertas
    Sharif, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 69 (03): : 4061 - 4075
  • [36] Multi-Level Deep Learning Depth and Color Fusion for Action Recognition
    Zelensky, A.
    Voronin, V.
    Zhdanova, M.
    Gapon, N.
    Tokareva, O.
    Semenishchev, E.
    OPTICS, PHOTONICS AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS VII, 2022, 12138
  • [37] Multi-Layered Deep Learning Features Fusion for Human Action Recognition
    Kiran, Sadia
    Khan, Muhammad Attique
    Javed, Muhammad Younus
    Alhaisoni, Majed
    Tariq, Usman
    Nam, Yunyoung
    Damaševǐcius, Robertas
    Sharif, Muhammad
    Computers, Materials and Continua, 2021, 69 (03): : 4061 - 4075
  • [38] Joint Multi-Scale Residual and Motion Feature Learning for Action Recognition
    Yang, Linfeng
    Zhu, Zhixiang
    Wang, Chenwu
    Wang, Pei
    Hei, Shaobo
    ACM International Conference Proceeding Series, 2022, : 701 - 706
  • [39] Action Recognition Using Multi-stream 2D CNN with Deep Learning-Based Temporal Modality
    Kang, Keonwoo
    Park, Sangwoo
    Park, Hasil
    Kang, Donggoo
    Paik, Joonki
    2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
  • [40] A Temporal Sequence Learning for Action Recognition and Prediction
    Cho, Sangwoo
    Foroosh, Hassan
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 352 - 361