Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data

被引:8
|
作者
Kushwaha, Arati [1 ]
Khare, Ashish [1 ]
Prakash, Om [2 ]
机构
[1] Univ Allahabad, Dept Elect & Commun, Prayagraj, Uttar Pradesh, India
[2] HNB Garhwal Univ, Dept Comp Sci & Engn, Srinagar, India
来源
NEURAL COMPUTING & APPLICATIONS | 2023年 / 35卷 / 18期
关键词
Convolutional neural network; Human activity recognition; Micro-network; Softmax classifier; FEATURES; FRAMEWORK; IMAGE; TERM; BAG;
D O I
10.1007/s00521-023-08440-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the recent past, deep convolutional neural network (DCNN) has been used in majority of state-of-the-art methods due to its remarkable performance in number of computer vision applications. However, DCNN are computationally expensive and requires more resources as well as computational time. Also, deeper architectures are prone to overfitting problem, while small-size dataset is used. To address these limitations, we propose a simple and computationally efficient deep convolutional neural network (DCNN) architecture based on the concept multiscale processing for human activity recognition. We increased the width and depth of the network by carefully crafting the design of network, which results in improved utilization of computational resources. First, we designed a small micro-network with varying receptive field size convolutional kernels (1 x 1, 3 x 3, and 5 x 5) for extraction of unique discriminative information of human objects having variations in object size, pose, orientation, and view. Then, the proposed DCNN architecture is designed by stacking repeated building blocks of small micro-networks with same topology. Here, we factorize the larger convolutional operation in stack of smaller convolutional operations to make the network computationally efficient. The softmax classifier is used for activity classification. Advantage of the proposed architecture over standard deep architectures is its computational efficiency and flexibility to use with both small as well as large size datasets. To evaluate the effectiveness of the proposed architecture, several extensive experiments are conducted by using publically available datasets, namely UCF sports, IXMAS, YouTube, TV-HI, HMDB51, and UCF101 datasets. The activity recognition results have shown outperformance of the proposed method over other existing state-of-the-art methods.
引用
下载
收藏
页码:13321 / 13341
页数:21
相关论文
共 50 条
  • [1] Micro-network-based deep convolutional neural network for human activity recognition from realistic and multi-view visual data
    Arati Kushwaha
    Ashish Khare
    Om Prakash
    Neural Computing and Applications, 2023, 35 : 13321 - 13341
  • [2] A deep neural network model for multi-view human activity recognition
    Putra, Prasetia Utama
    Shima, Keisuke
    Shimatani, Koji
    PLOS ONE, 2022, 17 (01):
  • [3] Multi-view Face Recognition and Verification Based on Convolutional Neural Network
    Zeng, Xiongjun
    Wu, Qingxiang
    Han, Ming
    Huang, Xi
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [4] Vehicle Driving Behavior Recognition Based on Multi-View Convolutional Neural Network With Joint Data Augmentation
    Zhang, Yong
    Li, Junjie
    Guo, Yaohua
    Xu, Chaonan
    Bao, Jie
    Song, Yunpeng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (05) : 4223 - 4234
  • [5] Multi-view neural network based gait recognition
    Fazli, Saeid
    Askarifar, Hadis
    Shoaie, Maryam Sheikh
    World Academy of Science, Engineering and Technology, 2010, 43 : 705 - 709
  • [6] MULTI-VIEW BISTATIC SYNTHETIC APERTURE RADAR TARGET RECOGNITION BASED ON MULTI-INPUT DEEP CONVOLUTIONAL NEURAL NETWORK
    Pei, Jifang
    Huo, Weibo
    Zhang, Qianghui
    Huang, Yulin
    Miao, Yuxuan
    Zhang, Yin
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2314 - 2317
  • [7] A Multi-View Gait Recognition Method Using Deep Convolutional Neural Network and Channel Attention Mechanism
    Wang, Jiabin
    Peng, Kai
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2020, 125 (01): : 345 - 363
  • [8] Multi-View Gait Recognition Based on a Spatial-Temporal Deep Neural Network
    Tong, Suibing
    Fu, Yuzhuo
    Yue, Xinwei
    Ling, Hefei
    IEEE ACCESS, 2018, 6 : 57583 - 57596
  • [9] 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network
    Zhang, Le
    Sun, Jian
    Zheng, Qiang
    SENSORS, 2018, 18 (11)
  • [10] Configurable Convolutional Neural Network Accelerator Based on Multi-view Parallelism
    Ying S.
    Peng L.
    Gongcheng Kexue Yu Jishu/Advanced Engineering Science, 2022, 54 (02): : 188 - 195