An Improved Indoor Navigation Scheme Based on Vision-Language Localization

被引:4
|
作者
Xu, Ziheng [1 ]
Jia, Zixi [1 ]
Zhou, Xuegang [1 ]
Wen, Huan [1 ]
Li, Yanan [1 ]
机构
[1] Northeastern Univ, Fac Robot Sci & Engn, Shenyang 110819, Peoples R China
基金
中国国家自然科学基金;
关键词
Computer Vision; Natural Language Processing; Indoor Localization; Bidirectional A* Algorithm;
D O I
10.1109/CCDC52312.2021.9601828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an improved indoor localization and navigation method based on image matching and text extraction. Compared with traditional indoor localization methods based on WIFI, Bluetooth or laser, the improved method is mainly realized by image matching in computer vision and text extraction in natural language processing. The improved bidirectional A* algorithm is adopted to realize the optimal path planning, and the specific location of indoor localization and path planning diagrams are displayed through an app. Experimental results show that this method which can realize indoor location determination in real scenes, is convenient to use, and has low cost. It can overcome the problems of high hardware requirements, high cost, and poor transferability in the traditional indoor navigation field in terms of localization.
引用
收藏
页码:1047 / 1051
页数:5
相关论文
共 50 条
  • [31] Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
    Wang, Hanqing
    Liang, Wei
    Shen, Jianbing
    Van Gool, Luc
    Wang, Wenguan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 15450 - 15460
  • [32] Improved Speaker and Navigator for Vision-and-Language Navigation
    Wu, Zongkai
    Liu, Zihan
    Wang, Ting
    Wang, Donglin
    [J]. IEEE MULTIMEDIA, 2021, 28 (04) : 55 - 63
  • [33] Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks
    Wang, Wenhui
    Bao, Hangbo
    Dong, Li
    Bjorck, Johan
    Peng, Zhiliang
    Liu, Qiang
    Aggarwal, Kriti
    Mohammed, Owais Khan
    Singhal, Saksham
    Som, Subhojit
    Wei, Furu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19175 - 19186
  • [34] Causal Attention for Vision-Language Tasks
    Yang, Xu
    Zhang, Hanwang
    Qi, Guojun
    Cai, Jianfei
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 9842 - 9852
  • [35] Panoptic Vision-Language Feature Fields
    Chen, Haoran
    Blomqvist, Kenneth
    Milano, Francesco
    Siegwart, Roland
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (03) : 2144 - 2151
  • [36] Learning to Prompt for Vision-Language Models
    Kaiyang Zhou
    Jingkang Yang
    Chen Change Loy
    Ziwei Liu
    [J]. International Journal of Computer Vision, 2022, 130 : 2337 - 2348
  • [37] Learning to Prompt for Vision-Language Models
    Zhou, Kaiyang
    Yang, Jingkang
    Loy, Chen Change
    Liu, Ziwei
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (09) : 2337 - 2348
  • [38] Masked Vision-language Transformer in Fashion
    Ji, Ge-Peng
    Zhuge, Mingchen
    Gao, Dehong
    Fan, Deng-Ping
    Sakaridis, Christos
    Gool, Luc Van
    [J]. MACHINE INTELLIGENCE RESEARCH, 2023, 20 (03) : 421 - 434
  • [39] Masked Vision-language Transformer in Fashion
    Ge-Peng Ji
    Mingchen Zhuge
    Dehong Gao
    Deng-Ping Fan
    Christos Sakaridis
    Luc Van Gool
    [J]. Machine Intelligence Research, 2023, 20 : 421 - 434
  • [40] Vision based navigation for mobile robots in indoor environment by teaching and playing-back scheme
    Tang, LX
    Yuta, S
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 3072 - 3077