A branched Convolutional Neural Network for RGB-D image classification of ceramic pieces

被引:0
|
作者
Carreira, Daniel [1 ]
Rodrigues, Nuno [1 ]
Miragaia, Rolando [1 ]
Costa, Paulo [1 ]
Ribeiro, Jose [1 ]
Gaspar, Fabio [1 ]
Pereira, Antonio [1 ,2 ]
机构
[1] Polytech Inst Leiria, Comp Sci & Commun Res Ctr, Sch Technol & Management, P-2411901 Leiria, Portugal
[2] Leiria Off, Inst New Technol, INOV INESC Inovacao, P-2411901 Leiria, Portugal
关键词
Ceramic manufacturing; Convolutional neural network; Data fusion; Image classification; RGB-D;
D O I
10.1016/j.asoc.2024.112088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From smart sensors on assembly lines to robots performing complex tasks, the fourth industrial revolution is rapidly transforming manufacturing. The growing prominence of 3D cameras in the industry has led the computer vision community to explore innovative ways of integrating depth and color data to achieve higher precision, essential for ensuring product quality in manufacturing. In this study, we introduce an innovative branched convolutional neural network designed to produce high-speed classification of multimodal images, such as RGB-Depth (RGB-D) images. The fundamental concept underlying the branched approach is the specialization of each branch as a dedicated feature extractor for a single modality, followed by their merge (intermediate fusion) to enable effective classification. Feeding our model is our novel multimodal dataset, named CeramicNet, composed of 8 classes that include RGB, depth, and RGB-D variations to enable extensive experimentation and evaluation of the models which, to the best of our knowledge, has not been previously introduced in the computer vision community. We conducted a series of experiments on the CeramicNet dataset. These experiments aimed at fine-tuning the model, assessing the influence of various depth technologies, exploring individual modalities, examining their collective impact, and performing comprehensive data analysis. Comparing our solution against seven widely used models, we achieved remarkable results, securing the top position with a precision of 99.89, with a lead of over 1% against the nearest competitor. What is more, the proposed solution yields an inference time of 127.6 ms - being nearly three times faster than the second-best performer.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] RGB-D static gesture recognition based on convolutional neural network
    Xie, Bin
    He, Xiaoyu
    Li, Yi
    JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1515 - 1520
  • [2] CNN-CA: Convolutional Neural Network Combined with Active Contour for Image RGB-D Segmentation
    Boussit, Yoann
    Fresse, Virginie
    Konik, Hubert
    Morand, Karynn
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 4, 2023, 465 : 251 - 265
  • [3] Reduced Biquaternion Stacked Denoising Convolutional AutoEncoder for RGB-D Image Classification
    Huang, Xiang
    Gai, Shan
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1205 - 1209
  • [4] Multimodal Convolutional Neural Network for Object Detection Using RGB-D Images
    Mocanu, Irina
    Clapon, Cosmin
    2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 307 - 310
  • [5] Gait Recognition Using Convolutional Neural Network with RGB-D Sensor Data
    Ozaki, Fumio
    2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 213 - 218
  • [6] Grading Fruits and Vegetables Using RGB-D Images and Convolutional Neural Network
    Nishi, Toshiki
    Kurogi, Shuichi
    Matsuo, Kazuya
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3222 - 3227
  • [7] Vehicle Detection Algorithm Based on Convolutional Neural Network and RGB-D Images
    Wang Decheng
    Chen Xiangning
    Feng, Zhao
    Sun Haoran
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (18)
  • [8] Convolutional Neural Network for 3D Object Recognition Based on RGB-D Dataset
    Wang, Jianhua
    Lu, Jinjin
    Chen, Weihai
    Wu, Xingming
    PROCEEDINGS OF THE 2015 10TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, 2015, : 34 - 39
  • [9] RGB-D Indoor Object Recognition Algorithm Based on Fusion Convolutional Neural Network
    Wang, Decheng
    Yi, Hui
    Zhao, Feng
    2019 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, AUTOMATION AND CONTROL TECHNOLOGIES (AIACT 2019), 2019, 1267
  • [10] Hybrid RGB-D Object Recognition using Convolutional Neural Network and Fisher Vector
    Li, Wei
    Cao, Zhiguo
    Xiao, Yang
    Fang, Zhiwen
    2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 506 - 511