Convolutional encoder-decoder networks for pixel-wise ear detection and segmentation

被引:27
|
作者
Emersic, Ziga [1 ]
Gabriel, Luka L. [2 ]
Struc, Vitomir [3 ]
Peer, Peter [1 ]
机构
[1] Univ Ljubljana, Fac Comp & Informat Sci, Vecna Pot 113, SL-1000 Ljubljana, Slovenia
[2] KTH Royal Inst Technol, SE-10044 Stockholm, Sweden
[3] Univ Ljubljana, Fac Elect Engn, Trzaska 25, SL-1000 Ljubljana, Slovenia
关键词
object detection; computer vision; biometrics (access control); feature extraction; image segmentation; ear; convolutional encoder-decoder; pixel-wise ear detection; machine vision; biometric recognition systems; entire recognition system; ear accessories; ear images; ear detection technique; two-class segmentation problem; design; image-pixels; nonear class; detected ear; pixel-wise information; good detection results; RECOGNITION;
D O I
10.1049/iet-bmt.2017.0240
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection and segmentation represents the basis for many tasks in computer and machine vision. In biometric recognition systems the detection of the region-of-interest (ROI) is one of the most crucial steps in the processing pipeline, significantly impacting the performance of the entire recognition system. Existing approaches to ear detection, are commonly susceptible to the presence of severe occlusions, ear accessories or variable illumination conditions and often deteriorate in their performance if applied on ear images captured in unconstrained settings. To address these shortcomings, we present a novel ear detection technique based on convolutional encoder-decoder networks (CEDs). We formulate the problem of ear detection as a two-class segmentation problem and design and train a CED-network architecture to distinguish between image-pixels belonging to the ear and the non-ear class. Unlike competing techniques, our approach does not simply return a bounding box around the detected ear, but provides detailed, pixel-wise information about the location of the ears in the image. Experiments on a dataset gathered from the web (a.k.a. in the wild) show that the proposed technique ensures good detection results in the presence of various covariate factors and significantly outperforms competing methods from the literature.
引用
收藏
页码:175 / 184
页数:10
相关论文
共 50 条
  • [1] Pixel-Wise Segmentation of SAR Imagery Using Encoder-Decoder Network and Fully-Connected CRF
    Gao, Fei
    He, Yishan
    Wang, Jun
    Ma, Fei
    Yang, Erfu
    Hussain, Amir
    [J]. ADVANCES IN BRAIN INSPIRED COGNITIVE SYSTEMS, 2020, 11691 : 155 - 165
  • [2] Encoder-Decoder Architecture for Crop-Weed Classification Using Pixel-Wise Labelling
    Umamaheswari, S.
    Jain, Ashvini, V
    [J]. 2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
  • [3] Pixel-wise confidence estimation for segmentation in Bayesian Convolutional Neural Networks
    Rémi Martin
    Luc Duong
    [J]. Machine Vision and Applications, 2023, 34
  • [4] Pixel-wise confidence estimation for segmentation in Bayesian Convolutional Neural Networks
    Martin, Remi
    Duong, Luc
    [J]. MACHINE VISION AND APPLICATIONS, 2023, 34 (01)
  • [5] Deep Convolutional Encoder-Decoder for Myelin and Axon Segmentation
    Mesbah, Rassoul
    McCane, Brendan
    Mills, Steven
    [J]. PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2016, : 226 - 231
  • [6] Encoder-decoder based convolutional neural networks for image forgery detection
    El Biach, Fatima Zahra
    Iala, Imad
    Laanaya, Hicham
    Minaoui, Khalid
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (16) : 22611 - 22628
  • [7] Optimizing the Hyperparameters of Fully Convolutional Encoder-Decoder Networks for SAR Image Segmentation
    Liu, Yuanyue
    Zhao, Jin
    Fan, Jianchao
    Wang, Jun
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [8] Encoder-decoder based convolutional neural networks for image forgery detection
    Fatima Zahra El Biach
    Imad Iala
    Hicham Laanaya
    Khalid Minaoui
    [J]. Multimedia Tools and Applications, 2022, 81 : 22611 - 22628
  • [9] Cascaded deep convolutional encoder-decoder neural networks for efficient liver tumor segmentation
    Budak, Umit
    Guo, Yanhui
    Tanyildizi, Erkan
    Sengur, Abdulkadir
    [J]. MEDICAL HYPOTHESES, 2020, 134
  • [10] MRI Brain Tumor Segmentation Using Deep Encoder-Decoder Convolutional Neural Networks
    Yan, Benjamin B.
    Wei, Yujia
    Jagtap, Jaidip Manikrao M.
    Moassefi, Mana
    Garcia, Diana V. Vera
    Singh, Yashbir
    Vahdati, Sanaz
    Faghani, Shahriar
    Erickson, Bradley J.
    Conte, Gian Marco
    [J]. BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2021, PT II, 2022, 12963 : 80 - 89