Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data

被引:8
|
作者
Kushwaha, Arati [1 ]
Khare, Ashish [1 ]
Prakash, Om [2 ]
机构
[1] Univ Allahabad, Dept Elect & Commun, Prayagraj, Uttar Pradesh, India
[2] HNB Garhwal Univ, Dept Comp Sci & Engn, Srinagar, India
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 18期
关键词
Convolutional neural network; Human activity recognition; Micro-network; Softmax classifier; FEATURES; FRAMEWORK; IMAGE; TERM; BAG;
D O I
10.1007/s00521-023-08440-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the recent past, deep convolutional neural network (DCNN) has been used in majority of state-of-the-art methods due to its remarkable performance in number of computer vision applications. However, DCNN are computationally expensive and requires more resources as well as computational time. Also, deeper architectures are prone to overfitting problem, while small-size dataset is used. To address these limitations, we propose a simple and computationally efficient deep convolutional neural network (DCNN) architecture based on the concept multiscale processing for human activity recognition. We increased the width and depth of the network by carefully crafting the design of network, which results in improved utilization of computational resources. First, we designed a small micro-network with varying receptive field size convolutional kernels (1 x 1, 3 x 3, and 5 x 5) for extraction of unique discriminative information of human objects having variations in object size, pose, orientation, and view. Then, the proposed DCNN architecture is designed by stacking repeated building blocks of small micro-networks with same topology. Here, we factorize the larger convolutional operation in stack of smaller convolutional operations to make the network computationally efficient. The softmax classifier is used for activity classification. Advantage of the proposed architecture over standard deep architectures is its computational efficiency and flexibility to use with both small as well as large size datasets. To evaluate the effectiveness of the proposed architecture, several extensive experiments are conducted by using publically available datasets, namely UCF sports, IXMAS, YouTube, TV-HI, HMDB51, and UCF101 datasets. The activity recognition results have shown outperformance of the proposed method over other existing state-of-the-art methods.
引用
下载
收藏
页码:13321 / 13341
页数:21
相关论文
共 50 条
  • [31] Sign language recognition and translation network based on multi-view data
    Li, Ronghui
    Meng, Lu
    APPLIED INTELLIGENCE, 2022, 52 (13) : 14624 - 14638
  • [32] Fusion by synthesizing: A multi-view deep neural network for zero-shot recognition
    Xu, Xing
    Zhou, Xiang
    Shen, Fumin
    Gao, Lianli
    Shen, Heng Tao
    Li, Xuelong
    SIGNAL PROCESSING, 2019, 164 : 354 - 367
  • [33] Hierarchical Graph Attention Based Multi-View Convolutional Neural Network for 3D Object Recognition
    Zeng, Hui
    Zhao, Tianmeng
    Cheng, Ruting
    Wang, Fuzhou
    Liu, Jiwei
    IEEE ACCESS, 2021, 9 (09): : 33323 - 33335
  • [34] Gesture accuracy recognition based on grayscale image of surface electromyogram signal and multi-view convolutional neural network
    Chen, Qingzheng
    Tao, Qing
    Zhang, Xiaodong
    Hu, Xuezheng
    Zhang, Tianle
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2024, 41 (06): : 1153 - 1160
  • [35] Predicting vehicle fuel consumption based on multi-view deep neural network
    Li, Yawen
    Zeng, Isabella Yunfei
    Niu, Ziheng
    Shi, Jiahao
    Wang, Ziyang
    Guan, Zeli
    NEUROCOMPUTING, 2022, 502 : 140 - 147
  • [36] Using a Multi-view Convolutional Neural Network to monitor solar irradiance
    Huertas-Tato, Javier
    Galvan, Ines M.
    Aler, Ricardo
    Javier Rodriguez-Benitez, Francisco
    Pozo-Vazquez, David
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (13): : 10295 - 10307
  • [37] Deep Neural Network for Handcrafted Cost-based Multi-view Stereo
    Jeon, Yoonbae
    Park, In Kyu
    INTERNATIONAL WORKSHOP ON ADVANCED IMAGING TECHNOLOGY (IWAIT) 2021, 2021, 11766
  • [38] Multi-view fusion for recommendation with attentive deep neural network
    Jing, Wang
    Sangaiah, Arun Kumar
    Wei, Liu
    Shaopeng, Liu
    Lei, Liu
    Ruishi, Liang
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (04) : 2619 - 2629
  • [39] Multi-view fusion for recommendation with attentive deep neural network
    Wang Jing
    Arun Kumar Sangaiah
    Liu Wei
    Liu Shaopeng
    Liu Lei
    Liang Ruishi
    Evolutionary Intelligence, 2022, 15 : 2619 - 2629
  • [40] Using a Multi-view Convolutional Neural Network to monitor solar irradiance
    Javier Huertas-Tato
    Inés M. Galván
    Ricardo Aler
    Francisco Javier Rodríguez-Benítez
    David Pozo-Vázquez
    Neural Computing and Applications, 2022, 34 : 10295 - 10307