Deep learning based object detection and surrounding environment description for visually impaired people

被引:4
|
作者
Bin Islam, Raihan [1 ]
Akhter, Samiha [1 ]
Iqbal, Faria [1 ]
Rahman, Saif Ur
Khan, Riasat [1 ]
机构
[1] North South Univ, Elect & Comp Engn, Dhaka, Bangladesh
关键词
Assistive technologies; Machine learning; Mean average precision; Object detection; Random forest classifier; SSD MobileNet; Text-to-speech; SYSTEM;
D O I
10.1016/j.heliyon.2023.e16924
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Object detection, one of the most significant contributions of computer vision and machine learning, plays an immense role in identifying and locating objects in an image or a video. We recognize distinct objects and precisely get their information through object detection, such as their size, shape, and location. This paper developed a low-cost assistive system of obstacle detection and the surrounding environment depiction to help blind people using deep learning techniques. TensorFlow object detection API and SSDLite MobileNetV2 have been used to create the proposed object detection model. The pre-trained SSDLite MobileNetV2 model is trained on the COCO dataset, with almost 328,000 images of 90 different objects. The gradient particle swarm optimization (PSO) technique has been used in this work to optimize the final layers and their corresponding hyperparameters of the MobileNetV2 model. Next, we used the Google textto-speech module, PyAudio, playsound, and speech recognition to generate the audio feedback of the detected objects. A Raspberry Pi camera captures real-time video where real-time object detection is done frame by frame with Raspberry Pi 4B microcontroller. The proposed device is integrated into a head cap, which will help visually impaired people to detect obstacles in their path, as it is more efficient than a traditional white cane. Apart from this detection model, we trained a secondary computer vision model and named it the "ambiance mode." In this mode, the last three convolutional layers of SSDLite MobileNetV2 are trained through transfer learning on a weather dataset. The dataset comprises around 500 images from four classes: cloudy, rainy, foggy, and sunrise. In this mode, the proposed system will narrate the surrounding scene elaborately, almost like a human describing a landscape or a beautiful sunset to a visually impaired person. The performance of the object detection and ambiance description modes are tested and evaluated in a desktop computer and Raspberry Pi embedded system. Detection accuracy and mean average precision, frame rate, confusion matrix, and ROC curve measure the model's accuracy on both setups. This low-cost proposed system is believed to help visually impaired people in their dayto-day life.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Multisensor - based Object Detection in Indoor Environment for Visually Impaired People
    Patel, Charmi T.
    Mistry, Vaidehi J.
    Desai, Laxmi S.
    Meghrajani, Yogesh K.
    [J]. PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1023 - 1026
  • [2] A Frame-work assisting the Visually Impaired People: Common Object Detection and Pose Estimation in Surrounding Environment
    Van-Hung Le
    Hai Vu
    Thuy Thi Nguyen
    [J]. PROCEEDINGS OF 2018 5TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS 2018), 2018, : 216 - 221
  • [3] Object detection and recognition: using deep learning to assist the visually impaired
    Bhandari, Abinash
    Prasad, P. W. C.
    Alsadoon, Abeer
    Maag, Angelika
    [J]. DISABILITY AND REHABILITATION-ASSISTIVE TECHNOLOGY, 2021, 16 (03) : 280 - 288
  • [4] Object Detection to Assist Visually Impaired People: A Deep Neural Network Adventure
    Bashiri, Fereshteh S.
    LaRose, Eric
    Badger, Jonathan C.
    D'Souza, Roshan M.
    Yu, Zeyun
    Peissig, Peggy
    [J]. ADVANCES IN VISUAL COMPUTING, ISVC 2018, 2018, 11241 : 500 - 510
  • [5] The development of assisted- visually impaired people robot in the indoor environment based on deep learning
    Hsieh, Yi-Zeng
    Ku, Xiang-Long
    Lin, Shih-Syun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 6555 - 6578
  • [6] The development of assisted- visually impaired people robot in the indoor environment based on deep learning
    Yi-Zeng Hsieh
    Xiang-Long Ku
    Shih-Syun Lin
    [J]. Multimedia Tools and Applications, 2024, 83 : 6555 - 6578
  • [7] Vision Connect: A Smartphone Based Object Detection for Visually Impaired People
    Ramalingam, Devakunchari
    Tiwari, Swapnil
    Seth, Harsh
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 863 - 870
  • [8] Obstacle Detection System for Navigation Assistance of Visually Impaired People Based on Deep Learning Techniques
    Said, Yahia
    Atri, Mohamed
    Albahar, Marwan Ali
    Ben Atitallah, Ahmed
    Alsariera, Yazan Ahmad
    [J]. SENSORS, 2023, 23 (11)
  • [9] Deep Learning Based Audio Assistive System for Visually Impaired People
    Devi, S. Kiruthika
    Subalalitha, C. N.
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (01): : 1205 - 1219
  • [10] Deep Learning Based Mobile Assistive Device for Visually Impaired People
    Lee, Chan-Su
    Lee, Jae-Ik
    Seo, Han Eol
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-ASIA (ICCE-ASIA), 2021,