Complementary spatial transformer network for real-time 3D object recognition

被引:0
|
作者
Krishna Kumar, K. P. [1 ]
Paul, Varghese [2 ]
机构
[1] APJ Abdul Kalam Technol Univ, CET Campus, Thiruvananthapuram 695016, Kerala, India
[2] Rajagiri Sch Engn & Technol, Dept Comp Sci & Engn, Kochi 682039, Kerala, India
关键词
3D object recognition; Spatial transformer network; Spatial entropy; Target space; Real-time tiny deep learning models;
D O I
10.1007/s11554-023-01340-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tiny Deep Learning Models offer many advantages in various applications. From the perspective of statistical machine learning theory the contributions of this paper is to complement the research advances and results obtained so far in real-time 3D object recognition. We propose a Tiny Deep Learning Model named Complementary Spatial Transformer Network (CSTN) for Real-Time 3D object recognition. It turns out that CSTN's working, and analysis are much simplified in a target space setting. We make algorithmic enhancements to perform CSTN computations faster and keep the learning part of CSTN in minimal size. Finally, we provide the experimental verifications of the results obtained in publicly available point cloud data sets ModelNet40 and ShapeNetCore with our model performing 1.65-2 times better in DPS (Detections/s) rate on GPU hardware for 3D object recognition, when compared to state-of-the-art networks. Complementary Spatial Transformer Network architecture requires only 10-35% of trainable parameters, when compared to state-of-the-art networks, making the network easier to deploy in edge AI devices.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Real-time 3D semi-local surface patch extraction using GPGPUApplication to 3D object recognition
    Sergio Orts-Escolano
    Vicente Morell
    Jose Garcia-Rodriguez
    Miguel Cazorla
    Robert B. Fisher
    Journal of Real-Time Image Processing, 2015, 10 : 647 - 666
  • [22] T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition
    Liu, Kun
    Liu, Wu
    Gan, Chuang
    Tan, Mingkui
    Ma, Huadong
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7138 - 7145
  • [23] Real-time recognition of human gestures for 3D interaction
    Jaume-i-Capo, Antoni
    Varona, Javier
    Perales, Francisco J.
    ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 2008, 5098 : 419 - 430
  • [24] Learning and Recognition of 3D Visual Objects in Real-Time
    Hamid, Shihab
    Hengst, Bernhard
    AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 150 - 159
  • [25] Real-Time 3D Object Detection, Recognition and Presentation Using a Mobile Device for Assistive Navigation
    Chen J.
    Zhu Z.
    SN Computer Science, 4 (5)
  • [26] Real-Time Environmental Contour Construction Using 3D LiDAR and Image Recognition with Object Removal
    Wu, Tzu-Jung
    He, Rong
    Peng, Chao-Chung
    REMOTE SENSING, 2024, 16 (23)
  • [27] Real-time 3D
    Coco, D
    COMPUTER GRAPHICS WORLD, 1995, 18 (12) : 22 - +
  • [28] Real-time Spatial-temporal Context Approach for 3D Object Detection using LiDAR
    Kumar, K. S. Chidanand
    Al-Stouhi, Samir
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS (VEHITS), 2020, : 432 - 439
  • [29] Network algorithm real-time depth image 3D human recognition for augmented reality
    Renyong Huang
    Mingyi Sun
    Journal of Real-Time Image Processing, 2021, 18 : 307 - 319
  • [30] Network algorithm real-time depth image 3D human recognition for augmented reality
    Huang, Renyong
    Sun, Mingyi
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (02) : 307 - 319