Complementary spatial transformer network for real-time 3D object recognition

被引：0

作者：

Krishna Kumar, K. P. ^{[1
]}

Paul, Varghese ^{[2
]}

机构：

[1] APJ Abdul Kalam Technol Univ, CET Campus, Thiruvananthapuram 695016, Kerala, India

[2] Rajagiri Sch Engn & Technol, Dept Comp Sci & Engn, Kochi 682039, Kerala, India

来源：

JOURNAL OF REAL-TIME IMAGE PROCESSING | 2023年 / 20卷 / 05期

关键词：

3D object recognition; Spatial transformer network; Spatial entropy; Target space; Real-time tiny deep learning models;

D O I：

10.1007/s11554-023-01340-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tiny Deep Learning Models offer many advantages in various applications. From the perspective of statistical machine learning theory the contributions of this paper is to complement the research advances and results obtained so far in real-time 3D object recognition. We propose a Tiny Deep Learning Model named Complementary Spatial Transformer Network (CSTN) for Real-Time 3D object recognition. It turns out that CSTN's working, and analysis are much simplified in a target space setting. We make algorithmic enhancements to perform CSTN computations faster and keep the learning part of CSTN in minimal size. Finally, we provide the experimental verifications of the results obtained in publicly available point cloud data sets ModelNet40 and ShapeNetCore with our model performing 1.65-2 times better in DPS (Detections/s) rate on GPU hardware for 3D object recognition, when compared to state-of-the-art networks. Complementary Spatial Transformer Network architecture requires only 10-35% of trainable parameters, when compared to state-of-the-art networks, making the network easier to deploy in edge AI devices.

引用

页数：12

共 50 条

[21] Real-time 3D semi-local surface patch extraction using GPGPUApplication to 3D object recognition
Sergio Orts-Escolano
Vicente Morell
Jose Garcia-Rodriguez
Miguel Cazorla
Robert B. Fisher
Journal of Real-Time Image Processing, 2015, 10 : 647 - 666
[22] T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition
Liu, Kun
Liu, Wu
Gan, Chuang
Tan, Mingkui
Ma, Huadong
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7138 - 7145
[23] Real-time recognition of human gestures for 3D interaction
Jaume-i-Capo, Antoni
Varona, Javier
Perales, Francisco J.
ARTICULATED MOTION AND DEFORMABLE OBJECTS, PROCEEDINGS, 2008, 5098 : 419 - 430
[24] Learning and Recognition of 3D Visual Objects in Real-Time
Hamid, Shihab
Hengst, Bernhard
AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 150 - 159
[25] Real-Time 3D Object Detection, Recognition and Presentation Using a Mobile Device for Assistive Navigation
Chen J.
Zhu Z.
SN Computer Science, 4 (5)
[26] Real-Time Environmental Contour Construction Using 3D LiDAR and Image Recognition with Object Removal
Wu, Tzu-Jung
He, Rong
Peng, Chao-Chung
REMOTE SENSING, 2024, 16 (23)
[27] Real-time 3D
Coco, D
COMPUTER GRAPHICS WORLD, 1995, 18 (12) : 22 - +
[28] Real-time Spatial-temporal Context Approach for 3D Object Detection using LiDAR
Kumar, K. S. Chidanand
Al-Stouhi, Samir
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON VEHICLE TECHNOLOGY AND INTELLIGENT TRANSPORT SYSTEMS (VEHITS), 2020, : 432 - 439
[29] Network algorithm real-time depth image 3D human recognition for augmented reality
Renyong Huang
Mingyi Sun
Journal of Real-Time Image Processing, 2021, 18 : 307 - 319
[30] Network algorithm real-time depth image 3D human recognition for augmented reality
Huang, Renyong
Sun, Mingyi
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2021, 18 (02) : 307 - 319

← 1 2 3 4 5 →