A branched Convolutional Neural Network for RGB-D image classification of ceramic pieces

被引:0
|
作者
Carreira, Daniel [1 ]
Rodrigues, Nuno [1 ]
Miragaia, Rolando [1 ]
Costa, Paulo [1 ]
Ribeiro, Jose [1 ]
Gaspar, Fabio [1 ]
Pereira, Antonio [1 ,2 ]
机构
[1] Polytech Inst Leiria, Comp Sci & Commun Res Ctr, Sch Technol & Management, P-2411901 Leiria, Portugal
[2] Leiria Off, Inst New Technol, INOV INESC Inovacao, P-2411901 Leiria, Portugal
关键词
Ceramic manufacturing; Convolutional neural network; Data fusion; Image classification; RGB-D;
D O I
10.1016/j.asoc.2024.112088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
From smart sensors on assembly lines to robots performing complex tasks, the fourth industrial revolution is rapidly transforming manufacturing. The growing prominence of 3D cameras in the industry has led the computer vision community to explore innovative ways of integrating depth and color data to achieve higher precision, essential for ensuring product quality in manufacturing. In this study, we introduce an innovative branched convolutional neural network designed to produce high-speed classification of multimodal images, such as RGB-Depth (RGB-D) images. The fundamental concept underlying the branched approach is the specialization of each branch as a dedicated feature extractor for a single modality, followed by their merge (intermediate fusion) to enable effective classification. Feeding our model is our novel multimodal dataset, named CeramicNet, composed of 8 classes that include RGB, depth, and RGB-D variations to enable extensive experimentation and evaluation of the models which, to the best of our knowledge, has not been previously introduced in the computer vision community. We conducted a series of experiments on the CeramicNet dataset. These experiments aimed at fine-tuning the model, assessing the influence of various depth technologies, exploring individual modalities, examining their collective impact, and performing comprehensive data analysis. Comparing our solution against seven widely used models, we achieved remarkable results, securing the top position with a precision of 99.89, with a lead of over 1% against the nearest competitor. What is more, the proposed solution yields an inference time of 127.6 ms - being nearly three times faster than the second-best performer.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Quantum convolutional neural network for image classification
    Guoming Chen
    Qiang Chen
    Shun Long
    Weiheng Zhu
    Zeduo Yuan
    Yilin Wu
    Pattern Analysis and Applications, 2023, 26 : 655 - 667
  • [42] RGB-D Object Recognition and Pose Estimation based on Pre-trained Convolutional Neural Network Features
    Schwarz, Max
    Schulz, Hannes
    Behnke, Sven
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1329 - 1335
  • [43] An RGB-D Descriptor for Object Classification
    Arican, Erkut
    Aydin, Tarkan
    ROMANIAN JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, 2022, 25 (3-4): : 338 - 349
  • [44] Semantic RGB-D Image Synthesis
    Li, Shijie
    Li, Rong
    Gall, Juergen
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 944 - 952
  • [45] Neural RGB-D Surface Reconstruction
    Azinovic, Dejan
    Martin-Brualla, Ricardo
    Goldman, Dan B.
    Niessner, Matthias
    Thies, Justus
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6280 - 6291
  • [46] GENDER RECOGNITION ON RGB-D IMAGE
    Zhang, Xiaoxiong
    Javed, Sajid
    Obeid, Ahmad
    Dias, Jorge
    Werghi, Naoufel
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1836 - 1840
  • [47] 3-D Convolutional Neural Networks for RGB-D Salient Object Detection and Beyond
    Chen, Qian
    Zhang, Zhenxi
    Lu, Yanye
    Fu, Keren
    Zhao, Qijun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 4309 - 4323
  • [48] RGB-D Salient Object Detection via 3D Convolutional Neural Networks
    Chen, Qian
    Liu, Ze
    Zhang, Yi
    Fu, Keren
    Zhao, Qijun
    Du, Hongwei
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1063 - 1071
  • [49] RGB-D CAMERA POSE ESTIMATION USING DEEP NEURAL NETWORK
    Guo, Fei
    He, Yifeng
    Guan, Ling
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 408 - 412
  • [50] 3-D Gabor Convolutional Neural Network for Hyperspectral Image Classification
    Jia, Sen
    Liao, Jianhui
    Xu, Meng
    Li, Yan
    Zhu, Jiasong
    Sun, Weiwei
    Jia, Xiuping
    Li, Qingquan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60