Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引:0
|
作者
Xu, Chengyou [1 ]
Wang, Kaiwei [1 ]
Yang, Kailun [1 ]
Cheng, Ruiqi [1 ]
Bai, Jian [1 ]
机构
[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China
关键词
Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;
D O I
10.1117/12.2532550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Tactile Display Device for Visually Impaired Persons
    Seo, Kap-Ho
    Park, Yongsik
    Yun, Sungjo
    Park, Sungho
    Jun, Jungsoo
    Jeon, Kwang-Woo
    2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 749 - 754
  • [32] Sonar Assistive Device for Visually Impaired People
    Kamaludin, Muhammad Haziq
    Mahmood, Nasrul Humaimi
    Ahmad, Abd Hamid
    Omar, Camallil
    Yusof, Masdinah Alauyah Md
    JURNAL TEKNOLOGI, 2015, 73 (06):
  • [33] Detection of Exercise and Cooking Scene for Assitance of Visually Impaired People
    Bhatlawande, Shripad
    Shilaskar, Swati
    Abhyankar, Anant
    Ahire, Mahesh
    Chadgal, Ankush
    Madake, Jyoti
    PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2022, 2023, 475 : 493 - 508
  • [34] Rainy Night Scene Understanding With Near Scene Semantic Adaptation
    Di, Shuai
    Feng, Qi
    Li, Chun-Guang
    Zhang, Mei
    Zhang, Honggang
    Elezovikj, Semir
    Tan, Chiu C.
    Ling, Haibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (03) : 1594 - 1602
  • [35] Color learning and illumination invariance on mobile robots: A survey
    Sridharan, Mohan
    Stone, Peter
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2009, 57 (6-7) : 629 - 644
  • [36] Illumination Invariance for Driving Scene Optical Flow using Comparagram Preselection
    Dederscheck, David
    Mueller, Thomas
    Mester, Rudolf
    2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2012, : 742 - 747
  • [37] A Wearable Navigation Device for Visually Impaired People Based on the Real-Time Semantic Visual SLAM System
    Chen, Zhuo
    Liu, Xiaoming
    Kojima, Masaru
    Huang, Qiang
    Arai, Tatsuo
    SENSORS, 2021, 21 (04) : 1 - 14
  • [38] DESIGN OF THE DUAL CONSTELLATION GPS/GALILEO MOBILE DEVICE FOR IMPROVING NAVIGATION OF THE VISUALLY IMPAIRED IN AN URBAN AREA
    Drypczewski, K.
    Moszynski, M.
    Demkowicz, J.
    Bikonis, K.
    Stepnowski, A.
    POLISH MARITIME RESEARCH, 2015, 22 (04) : 15 - 20
  • [39] Indoor Scene Understanding with Geometric and Semantic Contexts
    Choi, Wongun
    Chao, Yu-Wei
    Pantofaru, Caroline
    Savarese, Silvio
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 112 (02) : 204 - 220
  • [40] The Cityscapes Dataset for Semantic Urban Scene Understanding
    Cordts, Marius
    Omran, Mohamed
    Ramos, Sebastian
    Rehfeld, Timo
    Enzweiler, Markus
    Benenson, Rodrigo
    Franke, Uwe
    Roth, Stefan
    Schiele, Bernt
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223