Convolutional neural networks for recognition and segmentation of aluminium profiles

被引:8
|
作者
Mazzeo, Pier Luigi [1 ]
Argentieri, Arturo [1 ]
De Luca, Federico [2 ]
Spagnolo, Paolo [1 ]
Distante, Cosimo [1 ]
Leo, Marco [1 ]
Carcagni, Pierluigi [1 ]
机构
[1] CNR, Inst Appl Sci & Intelligent Syst E Caianiello, Via Monteroni Snc,Univ Campus, Lecce, Italy
[2] Univ Salento, Via Monteroni Snc,Univ Campus, Lecce, Italy
关键词
Deep Learning; Neural Networks; Convolutional Neural Networks (CNN); Object Instance Segmentation; Mask R-CNN; Detectron; ResNet; FEATURES;
D O I
10.1117/12.2525687
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Learning to automatically recognize objects in the real world is a very important and stimulating challenge. This work deals with the problem of detecting aluminium profiles within images, using hierarchical representations such as those based on deep learning methods. The use of regional CNN, a conceptually simple, flexible, and general framework for object instance segmentation, allows to exceed the previous state-of-the-art results. This approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. Neural network training uses ResNet networks of depth 50 or 101 layers. In particular, the training dataset consists of synthetic data generated by CAD files. The Dataset creation process is fundamental: experimental results show that trivial datasets lead to poor detection performance. A rich dataset, instead, including more complex images, allows the network to learn more and better guaranteeing excellent results. How to get more data, if you do not have more data? To get more data, we just need to make minor alterations such as flips, scale or rotations to existing dataset. This process is known as Data augmentation. The performance of the proposed system strongly depends on the dataset used for training and on the backbone architecture used. This why, adopting a strategy that generates a priori a very large dataset the time required to create the annotation file grows almost exponentially. The validation and test dataset, on the other hand, consists of real images captured by cameras. The distinctiveness of this work consists to train a deep neural network with synthetic data (aluminium CAD files) and verify if it on real data. Experiments show that the implementation of architecture, as described above, leads to good performance in automatic detection and classification. Future work will be addressed to improve the network training process together with the architecture, the algorithms and the dataset creation process. The latter is proved to be fundamental for the balance and optimization of the whole process. The way is to develop not much augmented datasets focusing on online data augmentation during network training.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Aircraft Type Recognition Based on Segmentation With Deep Convolutional Neural Networks
    Zuo, Jiawei
    Xu, Guangluan
    Fu, Kun
    Sun, Xian
    Sun, Hao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (02) : 282 - 286
  • [2] Convolutional Neural Networks for Phoneme Recognition
    Glackin, Cornelius
    Wall, Julie
    Chollet, Gerard
    Dugan, Nazim
    Cannings, Nigel
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 190 - 195
  • [3] Convolutional Neural Networks for Speech Recognition
    Abdel-Hamid, Ossama
    Mohamed, Abdel-Rahman
    Jiang, Hui
    Deng, Li
    Penn, Gerald
    Yu, Dong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (10) : 1533 - 1545
  • [4] Convolutional neural networks for face recognition
    Lawrence, S
    Giles, CL
    Tsoi, AC
    1996 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1996, : 217 - 222
  • [5] Language recognition by convolutional neural networks
    Pour, L. Khosravani
    Farrokhi, A.
    SCIENTIA IRANICA, 2023, 30 (01) : 116 - 123
  • [6] Large-scale Multimodal Gesture Segmentation and Recognition based on Convolutional Neural Networks
    Wang, Huogen
    Wang, Pichao
    Song, Zhanjie
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3138 - 3146
  • [7] Convolutional Neural Networks for SAR Image Segmentation
    Malmgren-Hansen, David
    Nobel-Jorgensen, Morten
    2015 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2015, : 231 - 236
  • [8] Egyptian Hieroglyphs Segmentation with Convolutional Neural Networks
    Guidi, Tommaso
    Python, Lorenzo
    Forasassi, Matteo
    Cucci, Costanza
    Franci, Massimiliano
    Argenti, Fabrizio
    Barucci, Andrea
    ALGORITHMS, 2023, 16 (02)
  • [9] Convolutional neural networks for brain tumour segmentation
    Bhandari, Abhishta
    Koppen, Jarrad
    Agzarian, Marc
    INSIGHTS INTO IMAGING, 2020, 11 (01)
  • [10] Group Convolutional Neural Networks for DWI Segmentation
    Liu, Renfei
    Lauze, Francois
    Bekkers, Erik
    Erleben, Kenny
    Darkner, Sune
    GEOMETRIC DEEP LEARNING IN MEDICAL IMAGE ANALYSIS, VOL 194, 2022, 194 : 96 - 106