Complementary spatial transformer network for real-time 3D object recognition

被引:0
|
作者
Krishna Kumar, K. P. [1 ]
Paul, Varghese [2 ]
机构
[1] APJ Abdul Kalam Technol Univ, CET Campus, Thiruvananthapuram 695016, Kerala, India
[2] Rajagiri Sch Engn & Technol, Dept Comp Sci & Engn, Kochi 682039, Kerala, India
关键词
3D object recognition; Spatial transformer network; Spatial entropy; Target space; Real-time tiny deep learning models;
D O I
10.1007/s11554-023-01340-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tiny Deep Learning Models offer many advantages in various applications. From the perspective of statistical machine learning theory the contributions of this paper is to complement the research advances and results obtained so far in real-time 3D object recognition. We propose a Tiny Deep Learning Model named Complementary Spatial Transformer Network (CSTN) for Real-Time 3D object recognition. It turns out that CSTN's working, and analysis are much simplified in a target space setting. We make algorithmic enhancements to perform CSTN computations faster and keep the learning part of CSTN in minimal size. Finally, we provide the experimental verifications of the results obtained in publicly available point cloud data sets ModelNet40 and ShapeNetCore with our model performing 1.65-2 times better in DPS (Detections/s) rate on GPU hardware for 3D object recognition, when compared to state-of-the-art networks. Complementary Spatial Transformer Network architecture requires only 10-35% of trainable parameters, when compared to state-of-the-art networks, making the network easier to deploy in edge AI devices.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Complementary spatial transformer network for real-time 3D object recognitionA tiny deep learning model in target space
    K. P. Krishna Kumar
    Varghese Paul
    Journal of Real-Time Image Processing, 2023, 20
  • [2] VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition
    Maturana, Daniel
    Scherer, Sebastian
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 922 - 928
  • [3] Real-Time 3D Single Object Tracking With Transformer
    Shan, Jiayao
    Zhou, Sifan
    Cui, Yubo
    Fang, Zheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2339 - 2353
  • [4] PointNet: A 3D Convolutional Neural Network for Real-Time Object Class Recognition
    Garcia-Garcia, A.
    Gomez-Donoso, F.
    Garcia-Rodriguez, J.
    Orts-Escolano, S.
    Cazorla, M.
    Azorin-Lopez, J.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1578 - 1584
  • [5] 3D gesture based real-time object selection and recognition
    Raheja, Jagdish Lal
    Chandra, Mona
    Chaudhary, Ankit
    PATTERN RECOGNITION LETTERS, 2018, 115 : 14 - 19
  • [6] Real-Time 3D Object Detection and Recognition using a Smartphone
    Chen, Jin
    Zhu, Zhigang
    IMPROVE: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND VISION ENGINEERING, 2022, : 158 - 165
  • [7] Real-time 3D object recognition for automatic tracker initialization
    Blaskó, G
    Fua, P
    IEEE AND ACM INTERNATIONAL SYMPOSIUM ON AUGMENTED REALITY, PROCEEDINGS, 2001, : 175 - 176
  • [8] Real-Time 3D Visual Sensor for Robust Object Recognition
    Attamimi, Muhammad
    Mizutani, Akira
    Nakamura, Tomoaki
    Nagai, Takayuki
    Funakoshi, Kotaro
    Nakano, Mikio
    IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 4560 - 4565
  • [9] REAL-TIME 3D OBJECT TRACKING
    STEPHENS, RS
    IMAGE AND VISION COMPUTING, 1990, 8 (01) : 91 - 96
  • [10] Real-time 3D Recognition of Manipulated Object by Robot Hand Using 3D Sensor
    Minowa, Ryo
    Namiki, Akio
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 1798 - 1803