Histogram of Oriented Gradient-Based Fusion of Features for Human Action Recognition in Action Video Sequences

被引:44
|
作者
Patel, Chirag I. [1 ]
Labana, Dileep [1 ]
Pandya, Sharnil [2 ,3 ]
Modi, Kirit [4 ]
Ghayvat, Hemant [5 ]
Awais, Muhammad [6 ]
机构
[1] Parul Univ, Parul Inst Technol, Comp Sci & Engn, Vadodara 391760, India
[2] Symbiosis Int Deemed Univ, Symbiosis Ctr Appl Artificial Intelligence, Pune 412115, Maharashtra, India
[3] Symbiosis Int Deemed Univ, Symbiosis Inst Technol, Pune 412115, Maharashtra, India
[4] Sankalchand Patel Univ, Sankalchand Patel Coll Engn, Visnagar 384315, India
[5] Linnaeus Univ, Fac Technol, Comp Sci Dept, S-35195 Vaxjo, Sweden
[6] Fudan Univ, Sch Informat Sci & Technol, Dept Elect Engn, Ctr Intelligent Med Elect, Shanghai 200433, Peoples R China
关键词
moving object detection; surveillance system; support vector machines; histogram of oriented gradient; meta-cognitive neural network (MCNN); multiple kernel learning; DESCRIPTORS;
D O I
10.3390/s20247299
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Human Action Recognition (HAR) is the classification of an action performed by a human. The goal of this study was to recognize human actions in action video sequences. We present a novel feature descriptor for HAR that involves multiple features and combining them using fusion technique. The major focus of the feature descriptor is to exploits the action dissimilarities. The key contribution of the proposed approach is to built robust features descriptor that can work for underlying video sequences and various classification models. To achieve the objective of the proposed work, HAR has been performed in the following manner. First, moving object detection and segmentation are performed from the background. The features are calculated using the histogram of oriented gradient (HOG) from a segmented moving object. To reduce the feature descriptor size, we take an averaging of the HOG features across non-overlapping video frames. For the frequency domain information we have calculated regional features from the Fourier hog. Moreover, we have also included the velocity and displacement of moving object. Finally, we use fusion technique to combine these features in the proposed work. After a feature descriptor is prepared, it is provided to the classifier. Here, we have used well-known classifiers such as artificial neural networks (ANNs), support vector machine (SVM), multiple kernel learning (MKL), Meta-cognitive Neural Network (McNN), and the late fusion methods. The main objective of the proposed approach is to prepare a robust feature descriptor and to show the diversity of our feature descriptor. Though we are using five different classifiers, our feature descriptor performs relatively well across the various classifiers. The proposed approach is performed and compared with the state-of-the-art methods for action recognition on two publicly available benchmark datasets (KTH and Weizmann) and for cross-validation on the UCF11 dataset, HMDB51 dataset, and UCF101 dataset. Results of the control experiments, such as a change in the SVM classifier and the effects of the second hidden layer in ANN, are also reported. The results demonstrate that the proposed method performs reasonably compared with the majority of existing state-of-the-art methods, including the convolutional neural network-based feature extractors.
引用
收藏
页码:1 / 33
页数:32
相关论文
共 50 条
  • [1] Fusion of histogram based features for Human Action Recognition
    Sahoo, Suraj Prakash
    Silambarasi, R.
    Ari, Samit
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS (ICACCS), 2019, : 1012 - 1016
  • [2] Human action recognition using fusion of features for unconstrained video sequences
    Patel, Chirag I.
    Garg, Sanjay
    Zaveri, Tanish
    Banerjee, Asim
    Patel, Ripal
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2018, 70 : 284 - 301
  • [3] Action recognition based on binary patterns of action-history and histogram of oriented gradient
    Ahad, Md. Atiqur Rahman
    Islam, Md. Nazmul
    Jahan, Israt
    [J]. JOURNAL ON MULTIMODAL USER INTERFACES, 2016, 10 (04) : 335 - 344
  • [4] Action recognition based on binary patterns of action-history and histogram of oriented gradient
    Md. Atiqur Rahman Ahad
    Md. Nazmul Islam
    Israt Jahan
    [J]. Journal on Multimodal User Interfaces, 2016, 10 : 335 - 344
  • [5] Human Action Recognition Based on Pyramid Histogram of Oriented Gradients
    Wang, Jin
    Liu, Ping
    She, Mary F. H.
    Kouzani, Abbas
    Nahavandi, Saeid
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2449 - 2454
  • [6] Human Action Recognition Based on Fusion Features
    Yang, Shiqiang
    Yang, Jiangtao
    Li, Fei
    Fan, Guohao
    Li, Dexin
    [J]. CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 569 - 579
  • [7] Diverse Features Fusion Network for video-based action recognition
    Deng, Haoyang
    Kong, Jun
    Jiang, Min
    Liu, Tianshan
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 77
  • [8] HMM Based Action Recognition with Projection Histogram Features
    Vezzani, Roberto
    Baltieri, Davide
    Cucchiara, Rita
    [J]. RECOGNIZING PATTERNS IN SIGNALS, SPEECH, IMAGES, AND VIDEOS, 2010, 6388 : 286 - 293
  • [9] Human Action Recognition in Video by Fusion of Structural and Spatio-temporal Features
    Borzeshi, Ehsan Zare
    Concha, Oscar Perez
    Piccardi, Massimo
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2012, 7626 : 474 - 482
  • [10] Human Action Recognition in Video Sequence using Logistic Regression by Features Fusion Approach based on CNN Features
    Ahmad, Tariq
    Wu, Jinsong
    Khan, Imran
    Rahim, Asif
    Khan, Amjad
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (11) : 18 - 25