Continuous action segmentation and recognition using hybrid convolutional neural network-hidden Markov model model

被引:32
|
作者
Lei, Jun [1 ]
Li, Guohui [1 ]
Zhang, Jun [1 ]
Guo, Qiang [1 ]
Tu, Dan [1 ]
机构
[1] Natl Univ Def Technol, Coll Informat Syst & Management, Changsha, Hunan, Peoples R China
关键词
video signal processing; image segmentation; image recognition; neural nets; hidden Markov models; Gaussian processes; continuous action segmentation; continuous action recognition; hybrid convolutional neural network-hidden Markov model model; isolated action recognition; convolutional neural network; HMM; statistical dependences; CNN-HMM; Gaussian mixture model; Viterbi algorithm;
D O I
10.1049/iet-cvi.2015.0408
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continuous action recognition in video is more complicated compared with traditional isolated action recognition. Besides the high variability of postures and appearances of each action, the complex temporal dynamics of continuous action makes this problem challenging. In this study, the authors propose a hierarchical framework combining convolutional neural network (CNN) and hidden Markov model (HMM), which recognises and segments continuous actions simultaneously. The authors utilise the CNN's powerful capacity of learning high level features directly from raw data, and use it to extract effective and robust action features. The HMM is used to model the statistical dependences over adjacent sub-actions and infer the action sequences. In order to combine the advantages of these two models, the hybrid architecture of CNN-HMM is built. The Gaussian mixture model is replaced by CNN to model the emission distribution of HMM. The CNN-HMM model is trained using embedded Viterbi algorithm, and the data used to train CNN are labelled by forced alignment. The authors test their method on two public action dataset Weizmann and KTH. Experimental results show that the authors' method achieves improved recognition and segmentation accuracy compared with several other methods. The superior property of features learnt by CNN is also illustrated.
引用
收藏
页码:537 / 544
页数:8
相关论文
共 50 条
  • [1] GLOBAL OPTIMIZATION OF A NEURAL NETWORK-HIDDEN MARKOV MODEL HYBRID
    BENGIO, Y
    DEMORI, R
    FLAMMIA, G
    KOMPE, R
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (02): : 252 - 259
  • [2] A study on user defined spoken wake-up word recognition system using deep neural network-hidden Markov model hybrid model
    Yoon, Ki-mu
    Kim, Wooil
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2020, 39 (02): : 131 - 136
  • [3] A Neural Network Hidden Markov Model Hybrid for cursive word recognition
    Knerr, S
    Augustin, E
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1518 - 1520
  • [4] Offline handwritten word recognition using a hybrid neural network and Hidden Markov model
    Tay, YH
    Lallican, PM
    Khalid, M
    Viard-Gaudin, C
    Knerr, S
    [J]. ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 382 - 385
  • [5] Hidden Markov model and neural network hybrid
    Yook, D
    [J]. EURASIA-ICT 2002: INFORMATION AND COMMUNICATION TECHNOLOGY, PROCEEDINGS, 2002, 2510 : 196 - 203
  • [6] A hybrid neural network hidden Markov model approach for automatic story segmentation
    Yu, Jia
    Xie, Lei
    Xiao, Xiong
    Chng, Eng Siong
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (06) : 925 - 936
  • [7] A hybrid neural network hidden Markov model approach for automatic story segmentation
    Jia Yu
    Lei Xie
    Xiong Xiao
    Eng Siong Chng
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2017, 8 : 925 - 936
  • [8] Hybrid approaches to frontal view face recognition using the hidden Markov model and neural network
    Yoon, KS
    Ham, YK
    Park, RH
    [J]. PATTERN RECOGNITION, 1998, 31 (03) : 283 - 293
  • [9] Hybrid hidden Markov model neural network system for EMG signals recognition
    Kwon, J
    Min, H
    Hong, S
    [J]. PROCEEDINGS OF THE 18TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 18, PTS 1-5, 1997, 18 : 1468 - 1469
  • [10] Hybrid Hidden Markov Model and Artificial Neural Network for Automatic Speech Recognition
    Tang, Xian
    [J]. PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 682 - 685