Continuous action segmentation and recognition using hybrid convolutional neural network-hidden Markov model model

被引:33
|
作者
Lei, Jun [1 ]
Li, Guohui [1 ]
Zhang, Jun [1 ]
Guo, Qiang [1 ]
Tu, Dan [1 ]
机构
[1] Natl Univ Def Technol, Coll Informat Syst & Management, Changsha, Hunan, Peoples R China
关键词
video signal processing; image segmentation; image recognition; neural nets; hidden Markov models; Gaussian processes; continuous action segmentation; continuous action recognition; hybrid convolutional neural network-hidden Markov model model; isolated action recognition; convolutional neural network; HMM; statistical dependences; CNN-HMM; Gaussian mixture model; Viterbi algorithm;
D O I
10.1049/iet-cvi.2015.0408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continuous action recognition in video is more complicated compared with traditional isolated action recognition. Besides the high variability of postures and appearances of each action, the complex temporal dynamics of continuous action makes this problem challenging. In this study, the authors propose a hierarchical framework combining convolutional neural network (CNN) and hidden Markov model (HMM), which recognises and segments continuous actions simultaneously. The authors utilise the CNN's powerful capacity of learning high level features directly from raw data, and use it to extract effective and robust action features. The HMM is used to model the statistical dependences over adjacent sub-actions and infer the action sequences. In order to combine the advantages of these two models, the hybrid architecture of CNN-HMM is built. The Gaussian mixture model is replaced by CNN to model the emission distribution of HMM. The CNN-HMM model is trained using embedded Viterbi algorithm, and the data used to train CNN are labelled by forced alignment. The authors test their method on two public action dataset Weizmann and KTH. Experimental results show that the authors' method achieves improved recognition and segmentation accuracy compared with several other methods. The superior property of features learnt by CNN is also illustrated.
引用
收藏
页码:537 / 544
页数:8
相关论文
共 50 条
  • [21] Automatic Speech Recognition: Comparisons Between Convolutional Neural Networks, Hidden Markov Model and Hybrid Architecture
    Santos, Lyndaines
    Moreira, Nicolas de Araujo
    Sampaio, Robson
    Lima, Raizielle
    Oliveira, Francisco Carlos Mattos Brito
    EXPERT SYSTEMS, 2025, 42 (05)
  • [22] Combining hidden Markov model and fuzzy neural network for continuous recognition of complex dynamic gestures
    Huiyue Wu
    Jianmin Wang
    Xiaolong Zhang
    The Visual Computer, 2017, 33 : 1265 - 1278
  • [23] Combining hidden Markov model and fuzzy neural network for continuous recognition of complex dynamic gestures
    Wu, Huiyue
    Wang, Jianmin
    Zhang, Xiaolong
    VISUAL COMPUTER, 2017, 33 (10): : 1265 - 1278
  • [24] Deep Convolutional Neural Network Based Hidden Markov Model for Offline Handwritten Chinese Text Recognition
    Wang, Zi-Rui
    Du, Jun
    Hu, Jin-Shui
    Hu, Yu-Long
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 816 - 821
  • [25] Hybrid GrabCut Hidden Markov Model for Segmentation
    Saeed, Soobia
    Abdullah, Afnizanfaizal
    Jhanjhi, N. Z.
    Naqvi, Mehmood
    Masud, Mehedi
    AlZain, Mohammed A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 851 - 869
  • [26] Speech recognition algorithm based on neural network and hidden Markov model
    Zhao Jianhui
    Gao Hongbo
    Liu Yuchao
    Cheng Bo
    The Journal of China Universities of Posts and Telecommunications, 2018, 25 (04) : 28 - 37
  • [27] Speech recognition algorithm based on neural network and hidden Markov model
    Jianhui Z.
    Hongbo G.
    Yuchao L.
    Bo C.
    Journal of China Universities of Posts and Telecommunications, 2018, 25 (04): : 28 - 37
  • [28] Human Action Recognition Using Hybrid Method of Hidden Markov Model and Dirichlet Process Gaussian Mixture Model
    Cho, W. H.
    Kim, S. K.
    Park, S. Y.
    ADVANCED SCIENCE LETTERS, 2017, 23 (03) : 1652 - 1655
  • [29] Hybrid model of neural network and hidden Markov model for protein secondary structure prediction
    Shi, Ou-Yan
    Yang, Hui-Yun
    Yang, Jing
    Tian, Xin
    PROGRESS ON POST-GENOME TECHNOLOGIES, 2007, : 170 - 172
  • [30] Hybrid Model of Continuous Hidden Markov Model and Multi-Layer Perceptron in Speech Recognition
    Zhang, Peiling
    Li, Hui
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL II, PROCEEDINGS, 2009, : 62 - 65