Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data

被引:8
|
作者
Kushwaha, Arati [1 ]
Khare, Ashish [1 ]
Prakash, Om [2 ]
机构
[1] Univ Allahabad, Dept Elect & Commun, Prayagraj, Uttar Pradesh, India
[2] HNB Garhwal Univ, Dept Comp Sci & Engn, Srinagar, India
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 18期
关键词
Convolutional neural network; Human activity recognition; Micro-network; Softmax classifier; FEATURES; FRAMEWORK; IMAGE; TERM; BAG;
D O I
10.1007/s00521-023-08440-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the recent past, deep convolutional neural network (DCNN) has been used in majority of state-of-the-art methods due to its remarkable performance in number of computer vision applications. However, DCNN are computationally expensive and requires more resources as well as computational time. Also, deeper architectures are prone to overfitting problem, while small-size dataset is used. To address these limitations, we propose a simple and computationally efficient deep convolutional neural network (DCNN) architecture based on the concept multiscale processing for human activity recognition. We increased the width and depth of the network by carefully crafting the design of network, which results in improved utilization of computational resources. First, we designed a small micro-network with varying receptive field size convolutional kernels (1 x 1, 3 x 3, and 5 x 5) for extraction of unique discriminative information of human objects having variations in object size, pose, orientation, and view. Then, the proposed DCNN architecture is designed by stacking repeated building blocks of small micro-networks with same topology. Here, we factorize the larger convolutional operation in stack of smaller convolutional operations to make the network computationally efficient. The softmax classifier is used for activity classification. Advantage of the proposed architecture over standard deep architectures is its computational efficiency and flexibility to use with both small as well as large size datasets. To evaluate the effectiveness of the proposed architecture, several extensive experiments are conducted by using publically available datasets, namely UCF sports, IXMAS, YouTube, TV-HI, HMDB51, and UCF101 datasets. The activity recognition results have shown outperformance of the proposed method over other existing state-of-the-art methods.
引用
收藏
页码:13321 / 13341
页数:21
相关论文
共 50 条
  • [41] Multi-View Video Quality Enhancement Method Based on Multi-Scale Fusion Convolutional Neural Network and Visual Saliency
    Wang, Weizhe
    Dai, Erzhuang
    IEEE ACCESS, 2024, 12 : 33100 - 33108
  • [42] Multiscale Bidirectional Input Convolutional and Deep Neural Network for Human Activity Recognition
    Qiu Y.
    Lin L.
    Yang L.
    Li D.
    Song R.
    Xu G.
    Shen S.
    Wireless Communications and Mobile Computing, 2021, 2021
  • [43] Micro-network Based Convolutional Neural Network with Integration of Multilayer Feature Fusion Strategy for Human Activity Recognition
    Kushwaha, Arati
    Khare, Manish
    Khare, Ashish
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2022, 31 (08)
  • [44] Optimal Deep Convolutional Neural Network with Pose Estimation for Human Activity Recognition
    Nandagopal, S.
    Karthy, G.
    Oliver, A. Sheryl
    Subha, M.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (02): : 1719 - 1733
  • [45] Classification of Phonocardiogram Based on Multi-View Deep Network
    Tian, Guangyang
    Lian, Cheng
    Xu, Bingrong
    Zang, Junbin
    Zhang, Zhidong
    Xue, Chenyang
    NEURAL PROCESSING LETTERS, 2023, 55 (04) : 3655 - 3670
  • [46] Classification of Phonocardiogram Based on Multi-View Deep Network
    Guangyang Tian
    Cheng Lian
    Bingrong Xu
    Junbin Zang
    Zhidong Zhang
    Chenyang Xue
    Neural Processing Letters, 2023, 55 : 3655 - 3670
  • [47] Gesture Recognition based on Deep Convolutional Neural Network
    Jayanthi, P.
    Bhama, Ponsy R. K. Sathia
    2018 10TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2018, : 367 - 372
  • [48] CAPTCHA recognition based on deep convolutional neural network
    Wang, Jing
    Qin, Jiaohua
    Xiang, Xuyu
    Tan, Yun
    Pan, Nan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (05) : 5851 - 5861
  • [49] Chinese sign language recognition based on multi-view deep neural network for millimeter-wave radar
    Wang, Xing
    Cui, Chang
    Li, Cong
    Dong, Xichao
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS IV, 2022, 12276
  • [50] An improved human activity recognition technique based on convolutional neural network
    Raj, Ravi
    Kos, Andrzej
    SCIENTIFIC REPORTS, 2023, 13 (01)