Scene Description for Visually Impaired People with Multi-Label Convolutional SVM Networks

被引：10

作者：

Bazi, Yakoub ^{[1
]}

Alhichri, Haikel ^{[1
]}

Alajlan, Naif ^{[1
]}

Melgani, Farid ^{[2
]}

机构：

[1] King Saud Univ, Coll Comp & Informat Sci, Dept Comp Engn, Riyadh 11543, Saudi Arabia

[2] Univ Trento, Dept Informat Engn & Comp Sci, Via Sommarive 9, I-38123 Trento, Italy

来源：

APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 23期

关键词：

visually impaired (VI); computer vision; deep learning; multi-label convolutional support vector machine (M-CSVM); OBJECT DETECTION; RECOGNITION; AID;

D O I：

10.3390/app9235062

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In this paper, we present a portable camera-based method for helping visually impaired (VI) people to recognize multiple objects in images. This method relies on a novel multi-label convolutional support vector machine (CSVM) network for coarse description of images. The core idea of CSVM is to use a set of linear SVMs as filter banks for feature map generation. During the training phase, the weights of the SVM filters are obtained using a forward-supervised learning strategy unlike the backpropagation algorithm used in standard convolutional neural networks (CNNs). To handle multi-label detection, we introduce a multi-branch CSVM architecture, where each branch will be used for detecting one object in the image. This architecture exploits the correlation between the objects present in the image by means of an opportune fusion mechanism of the intermediate outputs provided by the convolution layers of each branch. The high-level reasoning of the network is done through binary classification SVMs for predicting the presence/absence of objects in the image. The experiments obtained on two indoor datasets and one outdoor dataset acquired from a portable camera mounted on a lightweight shield worn by the user, and connected via a USB wire to a laptop processing unit are reported and discussed.

引用

页数：13

共 50 条

[31] Improving multi-label classification using scene cues
Li, Zhao
Lu, Wei
Sun, Zhanquan
Xing, Weiwei
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (05) : 6079 - 6094
[32] Three-way graph convolutional network for multi-label classification in multi-label information system
Yu, Bin
Xie, Hengjie
Fu, Yu
Xu, Zeshui
APPLIED SOFT COMPUTING, 2024, 161
[33] Improving multi-label classification using scene cues
Zhao Li
Wei Lu
Zhanquan Sun
Weiwei Xing
Multimedia Tools and Applications, 2018, 77 : 6079 - 6094
[34] Detection of Exercise and Cooking Scene for Assitance of Visually Impaired People
Bhatlawande, Shripad
Shilaskar, Swati
Abhyankar, Anant
Ahire, Mahesh
Chadgal, Ankush
Madake, Jyoti
PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022, 2023, 475 : 493 - 508
[35] Modeling Label Interactions in Multi-label Classification: A Multi-structure SVM Perspective
Kasinikota, Anusha
Balamurugan, P.
Shevade, Shirish
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 43 - 55
[36] Convolutional Neural Networks and Ensembles for Visually Impaired Aid
Breve, Fabricio
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2023, PT I, 2023, 13956 : 520 - 534
[37] Hierarchical Multi-Label Classification Networks
Wehrmann, Jonatas
Cerri, Ricardo
Barros, Rodrigo C.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[38] Control Chart Concurrent Pattern Classification Using Multi-Label Convolutional Neural Networks
Cheng, Chuen-Sheng
Chen, Pei-Wen
Ho, Ying
APPLIED SCIENCES-BASEL, 2022, 12 (02):
[39] A Unified Modular Framework with Deep Graph Convolutional Networks for Multi-label Image Recognition
Lin, Qifan
Chen, Zhaoliang
Wang, Shiping
Guo, Wenzhong
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 54 - 65
[40] Multi-Label Image Classification Based on Object Detection and Dynamic Graph Convolutional Networks
Liu, Xiaoyu
Hu, Yong
CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 80 (03): : 4413 - 4432

← 1 2 3 4 5 →