Semantic scene understanding on mobile device with illumination invariance for the visually impaired

被引:0
|
作者
Xu, Chengyou [1 ]
Wang, Kaiwei [1 ]
Yang, Kailun [1 ]
Cheng, Ruiqi [1 ]
Bai, Jian [1 ]
机构
[1] Zhejiang Univ, State Key Lab Modern Opt Instrumentat, Hangzhou 310027, Peoples R China
关键词
Semantic segmentation; robustness; illumination invariance; scene understanding; mobile device;
D O I
10.1117/12.2532550
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For Visually Impaired People (VIP), it's very difficult to perceive their surroundings. To address this problem, we propose a scene understanding system to aid VIP in indoor and outdoor environments. Semantic segmentation performance is generally sensitive to the environment and illumination changes, including the change between indoor and outdoor environments and the change across different weather conditions. Meanwhile, most existing methods have paid more attention on either the accuracy or the efficiency, instead of the balance between both of them. In the proposed system, the training dataset is preprocessed by using an illumination-invariant transformation to weaken the impact of illumination changes and improve the robustness of the semantic segmentation network. Regarding the structure of semantic segmentation network, the lightweight networks such as MobileNetV2 and ShuffleNet V2 are employed as the backbone of DeepLabv3+ to improve the accuracy with little increasing of computation, which is suitable for mobile assistance device. We evaluate the robustness of the segmentation model across different environments on the Gardens Point Walking dataset, and demonstrate the extremely positive effect of the illumination-invariant pre-transformation in challenging real-world domain. The network trained on computer achieves a relatively high accuracy on ADE20K relabeled into 20 classes. The frame rate of the proposed system is up to 83 FPS on a 1080Ti GPU.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Wearable Messaging Device for Visually Impaired Person
    Kumar, Suresh C.
    Julian, Anitha
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 994 - 998
  • [22] Interactive Reader Device for Visually Impaired People
    Motto Ros, Paolo
    Paseroa, Eros
    Del Giudice, Paolo
    Dante, Vittorio
    Petetti, Erminio
    NEURAL NETS WIRN09, 2009, 204 : 306 - 313
  • [23] The Talking Color Identifying Device for the Visually Impaired
    Mungkaruna, Preeyada
    Ropkhop, Kittiphong
    Piyawongwisal, Pratch
    Hatthasin, Upady
    2016 13TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2016,
  • [24] Adaptation of a piano tuning device for the visually impaired
    Davis, RL
    Sun, Y
    Sanderson, PL
    2005 IEEE 31ST ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2005, : 85 - 86
  • [25] A Navigation Device with Voice for Visually Impaired People
    Nehal, S. K.
    Obheroi, Rajat Kumar
    Anand, Ajay
    Rana, Haneet
    INTELLIGENT COMMUNICATION, CONTROL AND DEVICES, ICICCD 2017, 2018, 624 : 1369 - 1380
  • [26] Ultrasonic Echolocation Device for Assisting the Visually Impaired
    Mick, Ben
    Reddmann, Nathan
    Manwar, Rayyan
    Avanaki, Mohammad R. N.
    CURRENT MEDICAL IMAGING, 2020, 16 (05) : 601 - 610
  • [27] Information presentation device realizing assistance of active understanding for visually-impaired people.
    Une, Y
    Haraikawa, T
    Sakane, Y
    Takebayashi, Y
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ACTIVE MEDIA TECHNOLOGY (AMT 2005), 2005, : 22 - 27
  • [28] Assistive mobile application for visually impaired people
    Nayak S.
    Chandrakala C.B.
    Chandrakala, C.B. (chandrakala.cb@manipal.edu), 1600, International Association of Online Engineering (14): : 52 - 69
  • [29] Mobile Crowd Assisted Navigation for the Visually Impaired
    Olmschenk, Greg
    Yang, Christopher
    Zhu, Zhigang
    Tong, Hanghang
    Seiple, William H.
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 324 - 327
  • [30] moBraille: Mobile Framework for Visually Impaired Users
    Kalmac, Hakan
    Diri, Banu
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1477 - 1480