Moving object recognition using multi-view three-dimensional convolutional neural networks

被引:25
|
作者
He, Tao [1 ]
Mao, Hua [1 ]
Yi, Zhang [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Machine Intelligence Lab, Chengdu 610065, Sichuan, Peoples R China
来源
NEURAL COMPUTING & APPLICATIONS | 2017年 / 28卷 / 12期
基金
美国国家科学基金会;
关键词
Moving object recognition; Multi-view learning; 3D convolutional neural networks; Feature extraction; Deep learning;
D O I
10.1007/s00521-016-2277-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Moving object recognition (MOR) is an important but challenging problem in the field of computer vision. The aim of MOR is to recognize moving objects in a given video dataset. Convolutional neural networks (CNNs) have been extensively used for image recognition and video analysis problems. Recently, a 3D-CNN, which contains 3D convolution layers, was proposed to address MOR problems by successfully extracting spatiotemporal features. In this paper, a multi-view (MV) 3D-CNN is proposed for MOR. This model combines 3D-CNNs with a well-known MV learning technique. Because multi-view learning techniques have the ability to obtain more view-related features from videos captured by different cameras, the proposed model can extract more representative features. Moreover, the model contains a special view-pooling layer that can fuse the feature information from previous layers. The proposed MV3D-CNN is applied to both real-world moving vehicle recognition and sign language recognition tasks. The experimental results show that the proposed model possesses good performance.
引用
收藏
页码:3827 / 3835
页数:9
相关论文
共 50 条
  • [1] Moving object recognition using multi-view three-dimensional convolutional neural networks
    Tao He
    Hua Mao
    Zhang Yi
    [J]. Neural Computing and Applications, 2017, 28 : 3827 - 3835
  • [2] 3D object recognition based on pairwise Multi-view Convolutional Neural Networks
    Gao, Z.
    Wang, D. Y.
    Xue, Y. B.
    Xu, G. P.
    Zhang, H.
    Wang, Y. L.
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 : 305 - 315
  • [3] MULTI-VIEW GAIT RECOGNITION USING 3D CONVOLUTIONAL NEURAL NETWORKS
    Wolf, Thomas
    Babaee, Mohammadreza
    Rigoll, Gerhard
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4165 - 4169
  • [4] On Multi-view Interpretation of Convolutional Neural Networks
    Khastavaneh, Hassan
    Ebrahimpour-Komleh, Hossein
    [J]. 2019 IEEE 5TH CONFERENCE ON KNOWLEDGE BASED ENGINEERING AND INNOVATION (KBEI 2019), 2019, : 587 - 591
  • [5] Multi-view classification with convolutional neural networks
    Seeland, Marco
    Maeder, Patrick
    [J]. PLOS ONE, 2021, 16 (01):
  • [6] Multi-view Convolutional Neural Networks for 3D Shape Recognition
    Su, Hang
    Maji, Subhransu
    Kalogerakis, Evangelos
    Learned-Miller, Erik
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 945 - 953
  • [7] MVGCN: Multi-View Graph Convolutional Neural Network for Surface Defect Identification Using Three-Dimensional Point Cloud
    Wang, Yinan
    Sun, Wenbo
    Jin, Jionghua
    Kong, Zhenyu
    Yue, Xiaowei
    [J]. JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2023, 145 (03):
  • [8] A Novel Sketch-Based Three-Dimensional Shape Retrieval Method Using Multi-View Convolutional Neural Network
    Mao, Dianhui
    Hao, Zhihao
    [J]. SYMMETRY-BASEL, 2019, 11 (05):
  • [9] Multi-view face recognition using deep neural networks
    Zhao, Feng
    Li, Jing
    Zhang, Lu
    Li, Zhe
    Na, Sang-Gyun
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 111 (375-380): : 375 - 380
  • [10] Multi-view Face Detection Using Deep Convolutional Neural Networks
    Farfade, Sachin Sudhakar
    Saberian, Mohammad
    Li, Li-Jia
    [J]. ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, : 643 - 650