A View-Invariant Action Recognition Based on Multi-View Space Hidden Markov Models

被引:3
|
作者
Ji, Xiaofei [1 ]
Wang, Ce [1 ]
Li, Yibo [1 ]
机构
[1] Shenyang Aerosp Univ, Sch Automat, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Action recognition; view-invariant; view space partition; hidden Markov models;
D O I
10.1142/S021984361450011X
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Visual-based action recognition has already been widely used in human-machine interfaces. However, it is a challenging research to recognize the human actions from different viewpoints. In order to solve this issue, a novel multi-view space hidden Markov models (HMMs) algorithm for view-invariant action recognition is proposed. First, a view-insensitive feature representation by combining the bag-of-words of interest point with the amplitude histogram of optical flow is utilized for describing the human action sequences. The combined features could not only solve the problem that there was no possibility in establishing an association between traditional bag-of-words of interest point method and HMMs, but also greatly reduce the redundancy in the video. Second, the view space is partitioned into multiple sub-view space according to the camera rotation viewpoint. Human action models are trained by HMMs algorithm in each sub-view space. By computing the probabilities of the test sequence (i.e., observation sequence) for the given multi-view space HMMs, the similarity between the sub-view space and the test sequence viewpoint are analyzed during the recognition process. Finally, the action with unknown viewpoint is recognized via the probability weighted combination. The experimental results on multi-view action dataset IXMAS demonstrate that the proposed approach is highly efficient and effective in view-invariant action recognition.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition
    Liu, Yang
    Lu, Zhaoyang
    Li, Jing
    Yang, Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2416 - 2430
  • [22] Head nod and shake recognition based on multi-view model and Hidden Markov Model
    Lu, P
    Zhang, MD
    Zhu, XS
    Wang, YS
    COMPUTER GRAPHICS, IMAGING AND VISION: NEW TRENDS, 2005, : 61 - 64
  • [23] View-Invariant Human Action Recognition Via View Transformation Network (VTN)
    Gao, Lingling
    Ji, Yanli
    Gedamu, Kumie
    Zhu, Xiaofeng
    Xu, Xing
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4493 - 4503
  • [24] Deep Cross-view Convolutional Features for View-invariant Action Recognition
    Ulhaq, Anwaar
    2018 IEEE THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, APPLICATIONS AND SYSTEMS (IPAS), 2018, : 137 - 142
  • [25] Dual-attention Network for View-invariant Action Recognition
    Gedamu Alemu Kumie
    Maregu Assefa Habtie
    Tewodros Alemu Ayall
    Changjun Zhou
    Huawen Liu
    Abegaz Mohammed Seid
    Aiman Erbad
    Complex & Intelligent Systems, 2024, 10 : 305 - 321
  • [26] Dual-attention Network for View-invariant Action Recognition
    Kumie, Gedamu Alemu
    Habtie, Maregu Assefa
    Ayall, Tewodros Alemu
    Zhou, Changjun
    Liu, Huawen
    Seid, Abegaz Mohammed
    Erbad, Aiman
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 305 - 321
  • [27] Attention Transfer (ANT) Network for View-invariant Action Recognition
    Ji, Yanli
    Xu, Feixiang
    Yang, Yang
    Xie, Ning
    Shen, Heng Tao
    Harada, Tatsuya
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 574 - 582
  • [28] View-Invariant Representation and Recognition of Actions
    Cen Rao
    Alper Yilmaz
    Mubarak Shah
    International Journal of Computer Vision, 2002, 50 : 203 - 226
  • [29] View-invariant representation and recognition of actions
    Rao, C
    Yilmaz, A
    Shah, M
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 50 (02) : 203 - 226
  • [30] Multi-View Latent Variable Discriminative Models For Action Recognition
    Song, Yale
    Morency, Louis-Philippe
    Davis, Randall
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 2120 - 2127