Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引：0

作者：

Xu, Chengyou ^{[1
]}

Wang, Kaiwei ^{[1
]}

Yang, Kailun ^{[1
]}

Cheng, Ruiqi ^{[1
]}

Bai, Jian ^{[1
]}

机构：

[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China

来源：

ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS | 2019年 / 11169卷

关键词：

Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;

D O I：

10.1117/12.2532550

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.

引用

页数：9

共 50 条

[1] Dynamic Crosswalk Scene Understanding for the Visually Impaired
Tian, Shishun
Zheng, Minghuo
Zou, Wenbin
Li, Xia
Zhang, Lu
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 1478 - 1486
[2] Semantic Traffic Light Understanding for Visually Impaired Pedestrian
Pongseesai, Chanagan
Chamnongthai, Kosin
2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
[3] Mobile Reader: Turkish Scene Text Reader for the Visually Impaired
Kandemir, Hilal
Canturk, Busra
Bastan, Muhammet
2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1857 - 1860
[4] Stabilization of Magnified Videos on a Mobile Device for Visually Impaired
Li, Zewen
Pundlik, Shrinivas
Luo, Gang
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 54 - +
[5] Mobile device accessibility for the visually impaired: problems mapping and recommendations
Pezzuto Damaceno, Rafael Jeferson
Braga, Juliana Cristina
Mena-Chalco, Jesus Pascual
UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2018, 17 (02) : 421 - 435
[6] Mobile device accessibility for the visually impaired: problems mapping and recommendations
Rafael Jeferson Pezzuto Damaceno
Juliana Cristina Braga
Jesús Pascual Mena-Chalco
Universal Access in the Information Society, 2018, 17 : 421 - 435
[7] Deep Learning Based Mobile Assistive Device for Visually Impaired People
Lee, Chan-Su
Lee, Jae-Ik
Seo, Han Eol
2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
[8] Ego-Semantic Labeling of Scene from Depth Image for Visually Impaired and Blind People
Zatout, Chayma
Larabi, Slimane
Mendili, Ilyes
Barnabe, Soedji Ablam Edoh
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4376 - 4384
[9] Building a Color Recognizer System on the Smart Mobile Device for the Visually Impaired People
Lee, Hsiao Ping
Huang, Jun-Te
Chen, Chien-Hsing
Sheu, Tzu-Fang
SIXTH INTERNATIONAL MULTI-CONFERENCE ON COMPUTING IN THE GLOBAL INFORMATION TECHNOLOGY (ICCGI 2011), 2011, : 95 - 98
[10] Scene description for visually impaired in outdoor environment
Quoc-Hung Nguyen
Thanh-Hai Tran
2013 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2013, : 398 - 403

← 1 2 3 4 5 →