A branched Convolutional Neural Network for RGB-D image classification of ceramic pieces

被引：0

作者：

Carreira, Daniel ^{[1
]}

Rodrigues, Nuno ^{[1
]}

Miragaia, Rolando ^{[1
]}

Costa, Paulo ^{[1
]}

Ribeiro, Jose ^{[1
]}

Gaspar, Fabio ^{[1
]}

Pereira, Antonio ^{[1
,2
]}

机构：

[1] Polytech Inst Leiria, Comp Sci & Commun Res Ctr, Sch Technol & Management, P-2411901 Leiria, Portugal

[2] Leiria Off, Inst New Technol, INOV INESC Inovacao, P-2411901 Leiria, Portugal

来源：

APPLIED SOFT COMPUTING | 2024年 / 165卷

关键词：

Ceramic manufacturing; Convolutional neural network; Data fusion; Image classification; RGB-D;

D O I：

10.1016/j.asoc.2024.112088

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

From smart sensors on assembly lines to robots performing complex tasks, the fourth industrial revolution is rapidly transforming manufacturing. The growing prominence of 3D cameras in the industry has led the computer vision community to explore innovative ways of integrating depth and color data to achieve higher precision, essential for ensuring product quality in manufacturing. In this study, we introduce an innovative branched convolutional neural network designed to produce high-speed classification of multimodal images, such as RGB-Depth (RGB-D) images. The fundamental concept underlying the branched approach is the specialization of each branch as a dedicated feature extractor for a single modality, followed by their merge (intermediate fusion) to enable effective classification. Feeding our model is our novel multimodal dataset, named CeramicNet, composed of 8 classes that include RGB, depth, and RGB-D variations to enable extensive experimentation and evaluation of the models which, to the best of our knowledge, has not been previously introduced in the computer vision community. We conducted a series of experiments on the CeramicNet dataset. These experiments aimed at fine-tuning the model, assessing the influence of various depth technologies, exploring individual modalities, examining their collective impact, and performing comprehensive data analysis. Comparing our solution against seven widely used models, we achieved remarkable results, securing the top position with a precision of 99.89, with a lead of over 1% against the nearest competitor. What is more, the proposed solution yields an inference time of 127.6 ms - being nearly three times faster than the second-best performer.

引用

页数：13

共 50 条

[1] RGB-D static gesture recognition based on convolutional neural network
Xie, Bin
He, Xiaoyu
Li, Yi
JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1515 - 1520
[2] CNN-CA: Convolutional Neural Network Combined with Active Contour for Image RGB-D Segmentation
Boussit, Yoann
Fresse, Virginie
Konik, Hubert
Morand, Karynn
PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 4, 2023, 465 : 251 - 265
[3] Reduced Biquaternion Stacked Denoising Convolutional AutoEncoder for RGB-D Image Classification
Huang, Xiang
Gai, Shan
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1205 - 1209
[4] Multimodal Convolutional Neural Network for Object Detection Using RGB-D Images
Mocanu, Irina
Clapon, Cosmin
2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 307 - 310
[5] Gait Recognition Using Convolutional Neural Network with RGB-D Sensor Data
Ozaki, Fumio
2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 213 - 218
[6] Grading Fruits and Vegetables Using RGB-D Images and Convolutional Neural Network
Nishi, Toshiki
Kurogi, Shuichi
Matsuo, Kazuya
2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3222 - 3227
[7] Vehicle Detection Algorithm Based on Convolutional Neural Network and RGB-D Images
Wang Decheng
Chen Xiangning
Feng, Zhao
Sun Haoran
LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (18)
[8] Convolutional Neural Network for 3D Object Recognition Based on RGB-D Dataset
Wang, Jianhua
Lu, Jinjin
Chen, Weihai
Wu, Xingming
PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 34 - 39
[9] RGB-D Indoor Object Recognition Algorithm Based on Fusion Convolutional Neural Network
Wang, Decheng
Yi, Hui
Zhao, Feng
2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
[10] Hybrid RGB-D Object Recognition using Convolutional Neural Network and Fisher Vector
Li, Wei
Cao, Zhiguo
Xiao, Yang
Fang, Zhiwen
2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 506 - 511

← 1 2 3 4 5 →