Complementary spatial transformer network for real-time 3D object recognition

被引：0

作者：

Krishna Kumar, K. P. ^{[1
]}

Paul, Varghese ^{[2
]}

机构：

[1] APJ Abdul Kalam Technol Univ, CET Campus, Thiruvananthapuram 695016, Kerala, India

[2] Rajagiri Sch Engn & Technol, Dept Comp Sci & Engn, Kochi 682039, Kerala, India

来源：

JOURNAL OF REAL-TIME IMAGE PROCESSING | 2023年 / 20卷 / 05期

关键词：

3D object recognition; Spatial transformer network; Spatial entropy; Target space; Real-time tiny deep learning models;

D O I：

10.1007/s11554-023-01340-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tiny Deep Learning Models offer many advantages in various applications. From the perspective of statistical machine learning theory the contributions of this paper is to complement the research advances and results obtained so far in real-time 3D object recognition. We propose a Tiny Deep Learning Model named Complementary Spatial Transformer Network (CSTN) for Real-Time 3D object recognition. It turns out that CSTN's working, and analysis are much simplified in a target space setting. We make algorithmic enhancements to perform CSTN computations faster and keep the learning part of CSTN in minimal size. Finally, we provide the experimental verifications of the results obtained in publicly available point cloud data sets ModelNet40 and ShapeNetCore with our model performing 1.65-2 times better in DPS (Detections/s) rate on GPU hardware for 3D object recognition, when compared to state-of-the-art networks. Complementary Spatial Transformer Network architecture requires only 10-35% of trainable parameters, when compared to state-of-the-art networks, making the network easier to deploy in edge AI devices.

引用

页数：12

共 50 条

[1] Complementary spatial transformer network for real-time 3D object recognitionA tiny deep learning model in target space
K. P. Krishna Kumar
Varghese Paul
Journal of Real-Time Image Processing, 2023, 20
[2] VoxNet: A 3D Convolutional Neural Network for Real-Time Object Recognition
Maturana, Daniel
Scherer, Sebastian
2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 922 - 928
[3] Real-Time 3D Single Object Tracking With Transformer
Shan, Jiayao
Zhou, Sifan
Cui, Yubo
Fang, Zheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2339 - 2353
[4] PointNet: A 3D Convolutional Neural Network for Real-Time Object Class Recognition
Garcia-Garcia, A.
Gomez-Donoso, F.
Garcia-Rodriguez, J.
Orts-Escolano, S.
Cazorla, M.
Azorin-Lopez, J.
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1578 - 1584
[5] 3D gesture based real-time object selection and recognition
Raheja, Jagdish Lal
Chandra, Mona
Chaudhary, Ankit
PATTERN RECOGNITION LETTERS, 2018, 115 : 14 - 19
[6] Real-Time 3D Object Detection and Recognition using a Smartphone
Chen, Jin
Zhu, Zhigang
IMPROVE: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND VISION ENGINEERING, 2022, : 158 - 165
[7] Real-time 3D object recognition for automatic tracker initialization
Blaskó, G
Fua, P
IEEE AND ACM INTERNATIONAL SYMPOSIUM ON AUGMENTED REALITY, PROCEEDINGS, 2001, : 175 - 176
[8] Real-Time 3D Visual Sensor for Robust Object Recognition
Attamimi, Muhammad
Mizutani, Akira
Nakamura, Tomoaki
Nagai, Takayuki
Funakoshi, Kotaro
Nakano, Mikio
IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 4560 - 4565
[9] REAL-TIME 3D OBJECT TRACKING
STEPHENS, RS
IMAGE AND VISION COMPUTING, 1990, 8 (01) : 91 - 96
[10] Real-time 3D Recognition of Manipulated Object by Robot Hand Using 3D Sensor
Minowa, Ryo
Namiki, Akio
2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 1798 - 1803

← 1 2 3 4 5 →