ChitChatGuide: Conversational Interaction Using Large Language Models for Assisting People with Visual Impairments to Explore a Shopping Mall

被引:0
|
作者
Kaniwa, Yuka [1 ]
Kuribayashi, Masaki [1 ]
Kayukawa, Seita [2 ]
Sato, Daisuke [3 ]
Takagi, Hironobu [2 ]
Asakawa, Chieko [4 ,5 ]
Morishima, Shigeo [6 ]
机构
[1] Waseda University, Tokyo, Japan
[2] IBM Research -Tokyo, Tokyo, Japan
[3] Robotics Institute, Carnegie Mellon University, Pittsburgh,PA, United States
[4] Miraikan -The National Museum of Emerging Science and Innovation, Tokyo, Japan
[5] IBM Research, Yorktown Heights,NY, United States
[6] Waseda Research Institute for Science and Engineering, Tokyo, Japan
关键词
Shopping centers;
D O I
10.1145/3676492
中图分类号
学科分类号
摘要
To enable people with visual impairments (PVI) to explore shopping malls, it is important to provide information for selecting destinations and obtaining information based on the individual's interests. We achieved this through conversational interaction by integrating a large language model (LLM) with a navigation system. ChitChatGuide allows users to plan a tour through contextual conversations, receive personalized descriptions of surroundings based on transit time, and make inquiries during navigation. We conducted a study in a shopping mall with 11 PVI, and the results reveal that the system allowed them to explore the facility with increased enjoyment. The LLM-based conversational interaction, by understanding vague and context-based questions, enabled the participants to explore unfamiliar environments effectively. The personalized and in-situ information generated by the LLM was both useful and enjoyable. Considering the limitations we identified, we discuss the criteria for integrating LLMs into navigation systems to enhance the exploration experiences of PVI. © 2024 ACM.
引用
收藏
相关论文
共 20 条
  • [1] Enabling Conversational Interaction with Mobile UI using Large Language Models
    Wang, Bryan
    Li, Gang
    Li, Yang
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2023, 2023,
  • [2] Assisting people with visual impairments in aiming at a target on a large wall-mounted display
    Kim, Kibum
    Ren, Xiangshi
    Choi, Seungmoon
    Tan, Hong Z.
    INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2016, 86 : 109 - 120
  • [3] Conversational Agents for Dementia using Large Language Models
    Favela, Jesus
    Cruz-Sandoval, Dagoberto
    Parra, Mario O.
    2023 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, ENC, 2024,
  • [4] VIAssist: Adapting Multi-modal Large Language Models for Users with Visual Impairments
    Yang, Bang
    He, Lixing
    Liu, Kaiwei
    Yan, Zhenyu
    PROCEEDINGS 2024 IEEE INTERNATIONAL WORKSHOP ON FOUNDATION MODELS FOR CYBER-PHYSICAL SYSTEMS & INTERNET OF THINGS, FMSYS 2024, 2024, : 32 - 37
  • [5] Towards Automatic Evaluation of NLG Tasks Using Conversational Large Language Models
    Riyadh, Md
    Shafiq, M. Omair
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT II, 2023, 676 : 425 - 437
  • [6] A Design of Interface for Visual-Impaired People to Access Visual Information from Images Featuring Large Language Models and Visual Language Models
    Zhang, Zhe-Xin
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [7] Extracting Implicit User Preferences in Conversational Recommender Systems Using Large Language Models
    Kim, Woo-Seok
    Lim, Seongho
    Kim, Gun-Woo
    Choi, Sang-Min
    MATHEMATICS, 2025, 13 (02)
  • [8] LEVA: Using Large Language Models to Enhance Visual Analytics
    Zhao, Yuheng
    Zhang, Yixing
    Zhang, Yu
    Zhao, Xinyi
    Wang, Junjie
    Shao, Zekai
    Turkay, Cagatay
    Chen, Siming
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (03) : 1830 - 1847
  • [9] Analysis of Localization and Space Interaction Models. Proposal of very good linked model applied to a study area for the localization of a large shopping mall
    Mazzei, Mauro
    Palma, Armando Luigi
    GEOMEDIA, 2014, 18 (02) : 265 - 275
  • [10] Building Trust in Conversational AI: A Review and Solution Architecture Using Large Language Models and Knowledge Graphs
    Zafar, Ahtsham
    Parthasarathy, Venkatesh Balavadhani
    Le Van, Chan
    Shahid, Saad
    Khan, Aafaq Iqbal
    Shahid, Arsalan
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (06)