A hybrid object detection approach for visually impaired persons using pigeon-inspired optimization and deep learning models

被引:0
|
作者
Alashjaee, Abdullah M. [1 ]
Aleisa, Hussah Nasser [2 ]
Darem, Abdulbasit A. [3 ,4 ]
Marzouk, Radwa [5 ]
机构
[1] Northern Border Univ, Coll Sci, Dept Comp Sci, Ar Ar, Saudi Arabia
[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Comp Sci, Riyadh, Saudi Arabia
[3] Northern Border Univ, Ctr Sci Res & Entrepreneurship, Ar Ar 73213, Saudi Arabia
[4] King Salman Ctr Disabil Res, Riyadh 11614, Saudi Arabia
[5] Cairo Univ, Fac Sci, Dept Math, Giza 12613, Egypt
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Object detection; Visually impaired persons; Deep learning; Pigeon-inspired optimization; Feature extraction;
D O I
10.1038/s41598-025-92239-7
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Visually challenged persons include a significant part of the population, and they exist all over the globe. Recently, technology has demonstrated its occurrence in each field, and state-of-the-art devices aid humans in their everyday lives. However, visually impaired people cannot view things around their atmospheres; they can only imagine the roaming surroundings. Furthermore, web-based applications are advanced to certify their protection. Using the application, the consumer can spin the requested task to share her/his position with the family members while threatening confidentiality. Through this application, visually challenged people's family members can follow their actions (acquire snapshots and position) while staying at their residences. A deep learning (DL) technique is trained with manifold images of entities highly related to the VIPs. Training images are amplified and physically interpreted to bring more strength to the trained method. This study proposes a Hybrid Approach to Object Detection for Visually Impaired Persons Using Attention-Driven Deep Learning (HAODVIP-ADL) technique. The major intention of the HAODVIP-ADL technique is to deliver a reliable and precise object detection system that supports the visually impaired person in navigating their surroundings safely and effectively. The presented HAODVIP-ADL method initially utilizes bilateral filtering (BF) for the image pre-processing stage to reduce noise while preserving edges for clarity. For object detection, the HAODVIP-ADL method employs the YOLOv10 framework. In addition, the backbone fusion of feature extraction models such as CapsNet and InceptionV3 is implemented to capture diverse spatial and contextual information. The bi-directional long short-term memory and multi-head attention (MHA-BiLSTM) approach is utilized to classify the object detection process. Finally, the hyperparameter tuning process is performed using the pigeon-inspired optimization (PIO) approach to advance the classification performance of the MHA-BiLSTM approach. The experimental results of the HAODVIP-ADL method are analyzed, and the outcomes are evaluated using the Indoor Objects Detection dataset. The experimental validation of the HAODVIP-ADL method portrayed a superior accuracy value of 99.74% over the existing methods.
引用
收藏
页数:23
相关论文
共 50 条