Scene Description for Visually Impaired People with Multi-Label Convolutional SVM Networks

被引:10
|
作者
Bazi, Yakoub [1 ]
Alhichri, Haikel [1 ]
Alajlan, Naif [1 ]
Melgani, Farid [2 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh 11543, Saudi Arabia
[2] Univ Trento, Dept Informat Engn & Comp Sci, Via Sommarive 9, I-38123 Trento, Italy
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 23期
关键词
visually impaired (VI); computer vision; deep learning; multi-label convolutional support vector machine (M-CSVM); OBJECT DETECTION; RECOGNITION; AID;
D O I
10.3390/app9235062
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper, we present a portable camera-based method for helping visually impaired (VI) people to recognize multiple objects in images. This method relies on a novel multi-label convolutional support vector machine (CSVM) network for coarse description of images. The core idea of CSVM is to use a set of linear SVMs as filter banks for feature map generation. During the training phase, the weights of the SVM filters are obtained using a forward-supervised learning strategy unlike the backpropagation algorithm used in standard convolutional neural networks (CNNs). To handle multi-label detection, we introduce a multi-branch CSVM architecture, where each branch will be used for detecting one object in the image. This architecture exploits the correlation between the objects present in the image by means of an opportune fusion mechanism of the intermediate outputs provided by the convolution layers of each branch. The high-level reasoning of the network is done through binary classification SVMs for predicting the presence/absence of objects in the image. The experiments obtained on two indoor datasets and one outdoor dataset acquired from a portable camera mounted on a lightweight shield worn by the user, and connected via a USB wire to a laptop processing unit are reported and discussed.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Multiple Object Scene Description for the Visually Impaired Using Pre-trained Convolutional Neural Networks
    Alhichri, Haikel
    Bin Jdira, Bilel
    Bazi, Yacoub
    Alajlan, Naif
    IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), 2016, 9730 : 290 - 295
  • [2] Multi-Label Image Recognition with Graph Convolutional Networks
    Chen, Zhao-Min
    Wei, Xiu-Shen
    Wang, Peng
    Guo, Yanwen
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5172 - 5181
  • [3] Scene description for visually impaired in outdoor environment
    Quoc-Hung Nguyen
    Thanh-Hai Tran
    2013 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2013, : 398 - 403
  • [4] A Multi-label Scene Categorization Model Based on Deep Convolutional Neural Network
    Zhao, Gaofeng
    Luo, Wang
    Cui, Yang
    Fan, Qiang
    Peng, Qiwei
    Kong, Zhen
    Zhu, Liang
    Zhang, Tai
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 128 - 135
  • [5] Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection
    Soleimani, Amir
    Nasrabadi, Nasser M.
    2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 1005 - 1010
  • [6] A Multi-Label Classification Model Using Convolutional Netural Networks
    Zhang, Guanglei
    Chen, Lei
    Ding, Yongsheng
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 2151 - 2156
  • [7] Graph convolutional network for multi-label VHR remote sensing scene recognition
    Khan, Nagma
    Chaudhuri, Ushasi
    Banerjee, Biplab
    Chaudhuri, Subhasis
    NEUROCOMPUTING, 2019, 357 : 36 - 46
  • [8] Graph convolutional networks with attention for multi-label weather recognition
    Kezhen Xie
    Zhiqiang Wei
    Lei Huang
    Qibing Qin
    Wenfeng Zhang
    Neural Computing and Applications, 2021, 33 : 11107 - 11123
  • [9] Multi-Label Wireless Interference Classification with Convolutional Neural Networks
    Grunau, Sergej
    Block, Dimitri
    Meier, Uwe
    2018 IEEE 16TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2018, : 187 - 192
  • [10] Multi-label Logo Classification Using Convolutional Neural Networks
    Gallego, Antonio-Javier
    Pertusa, Antonio
    Bernabeu, Marisa
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT I, 2020, 11867 : 485 - 497