Continuous action segmentation and recognition using hybrid convolutional neural network-hidden Markov model model

被引:33
|
作者
Lei, Jun [1 ]
Li, Guohui [1 ]
Zhang, Jun [1 ]
Guo, Qiang [1 ]
Tu, Dan [1 ]
机构
[1] Natl Univ Def Technol, Coll Informat Syst & Management, Changsha, Hunan, Peoples R China
关键词
video signal processing; image segmentation; image recognition; neural nets; hidden Markov models; Gaussian processes; continuous action segmentation; continuous action recognition; hybrid convolutional neural network-hidden Markov model model; isolated action recognition; convolutional neural network; HMM; statistical dependences; CNN-HMM; Gaussian mixture model; Viterbi algorithm;
D O I
10.1049/iet-cvi.2015.0408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continuous action recognition in video is more complicated compared with traditional isolated action recognition. Besides the high variability of postures and appearances of each action, the complex temporal dynamics of continuous action makes this problem challenging. In this study, the authors propose a hierarchical framework combining convolutional neural network (CNN) and hidden Markov model (HMM), which recognises and segments continuous actions simultaneously. The authors utilise the CNN's powerful capacity of learning high level features directly from raw data, and use it to extract effective and robust action features. The HMM is used to model the statistical dependences over adjacent sub-actions and infer the action sequences. In order to combine the advantages of these two models, the hybrid architecture of CNN-HMM is built. The Gaussian mixture model is replaced by CNN to model the emission distribution of HMM. The CNN-HMM model is trained using embedded Viterbi algorithm, and the data used to train CNN are labelled by forced alignment. The authors test their method on two public action dataset Weizmann and KTH. Experimental results show that the authors' method achieves improved recognition and segmentation accuracy compared with several other methods. The superior property of features learnt by CNN is also illustrated.
引用
收藏
页码:537 / 544
页数:8
相关论文
共 50 条
  • [31] Joint Training of Hidden Markov Model and Neural Network for Heart Sound Segmentation
    Renna, Francesco
    Martins, Miguel L.
    Coimbra, Miguel
    2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
  • [32] A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition
    Zi-Rui Wang
    Jun Du
    Wen-Chao Wang
    Jian-Fang Zhai
    Jin-Shui Hu
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 241 - 251
  • [33] Hidden Markov Model Representation Using Probabilistic Neural Network
    Hewahi, Nabil M.
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2018, 9 (03): : 50 - 62
  • [34] A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition
    Wang, Zi-Rui
    Du, Jun
    Wang, Wen-Chao
    Zhai, Jian-Fang
    Hu, Jin-Shui
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (04) : 241 - 251
  • [35] Convolutional recurrent neural networks with hidden Markov model bootstrap for scene text recognition
    Wang, Fenglei
    Guo, Qiang
    Lei, Jun
    Zhang, Jun
    IET COMPUTER VISION, 2017, 11 (06) : 497 - 504
  • [36] Driver Intention Recognition Method Using Continuous Hidden Markov Model
    Hou, Haijing
    Jin, Lisheng
    Niu, Qingning
    Sun, Yuqin
    Lu, Meng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2011, 4 (03) : 386 - 393
  • [37] Tone recognition of Vietnamese continuous speech using hidden Markov model
    Quang, Nguyen Hong
    Pascal, Nocera
    Eric, Castelli
    Van Loan, Trinh
    2008 SECOND INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS, 2008, : 233 - +
  • [38] Driver Intention Recognition Method Using Continuous Hidden Markov Model
    Hou H.
    Jin L.
    Niu Q.
    Sun Y.
    Lu M.
    International Journal of Computational Intelligence Systems, 2011, 4 (3) : 386 - 393
  • [40] On-line recognition of Korean characters using ART neural network and hidden Markov model
    Kim, SK
    Park, SM
    Lee, JK
    Kim, HJ
    JOURNAL OF SYSTEMS ARCHITECTURE, 1998, 44 (12) : 971 - 984