Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引:0
|
作者
Xu, Chengyou [1 ]
Wang, Kaiwei [1 ]
Yang, Kailun [1 ]
Cheng, Ruiqi [1 ]
Bai, Jian [1 ]
机构
[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China
关键词
Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;
D O I
10.1117/12.2532550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Dynamic Crosswalk Scene Understanding for the Visually Impaired
    Tian, Shishun
    Zheng, Minghuo
    Zou, Wenbin
    Li, Xia
    Zhang, Lu
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 1478 - 1486
  • [2] Semantic Traffic Light Understanding for Visually Impaired Pedestrian
    Pongseesai, Chanagan
    Chamnongthai, Kosin
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [3] Mobile Reader: Turkish Scene Text Reader for the Visually Impaired
    Kandemir, Hilal
    Canturk, Busra
    Bastan, Muhammet
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1857 - 1860
  • [4] Stabilization of Magnified Videos on a Mobile Device for Visually Impaired
    Li, Zewen
    Pundlik, Shrinivas
    Luo, Gang
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 54 - +
  • [5] Mobile device accessibility for the visually impaired: problems mapping and recommendations
    Pezzuto Damaceno, Rafael Jeferson
    Braga, Juliana Cristina
    Mena-Chalco, Jesus Pascual
    UNIVERSAL ACCESS IN THE INFORMATION SOCIETY, 2018, 17 (02) : 421 - 435
  • [6] Mobile device accessibility for the visually impaired: problems mapping and recommendations
    Rafael Jeferson Pezzuto Damaceno
    Juliana Cristina Braga
    Jesús Pascual Mena-Chalco
    Universal Access in the Information Society, 2018, 17 : 421 - 435
  • [7] Deep Learning Based Mobile Assistive Device for Visually Impaired People
    Lee, Chan-Su
    Lee, Jae-Ik
    Seo, Han Eol
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,
  • [8] Ego-Semantic Labeling of Scene from Depth Image for Visually Impaired and Blind People
    Zatout, Chayma
    Larabi, Slimane
    Mendili, Ilyes
    Barnabe, Soedji Ablam Edoh
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4376 - 4384
  • [9] Building a Color Recognizer System on the Smart Mobile Device for the Visually Impaired People
    Lee, Hsiao Ping
    Huang, Jun-Te
    Chen, Chien-Hsing
    Sheu, Tzu-Fang
    SIXTH INTERNATIONAL MULTI-CONFERENCE ON COMPUTING IN THE GLOBAL INFORMATION TECHNOLOGY (ICCGI 2011), 2011, : 95 - 98
  • [10] Scene description for visually impaired in outdoor environment
    Quoc-Hung Nguyen
    Thanh-Hai Tran
    2013 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC), 2013, : 398 - 403