Can Large Language Models Be Good Companions? An LLM-Based Eyewear System with Conversational Common Ground

被引：0

作者：

Xu, Zhenyu ^{[1
]}

Xu, Hailin ^{[1
]}

Lu, Zhouyang ^{[1
]}

Zhao, Yingying ^{[2
]}

Zhu, Rui ^{[3
]}

Wang, Yujiang ^{[4
]}

Dong, Mingzhi ^{[1
]}

Chang, Yuhu ^{[1
]}

Lv, Qin ^{[5
]}

Dick, Robert P. ^{[6
]}

Yang, Fan ^{[7
]}

Lu, Tun ^{[1
]}

Gu, Ning ^{[1
]}

Shang, Li ^{[1
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

[2] Univ Strathclyde, Dept Comp & Informat Sci, Glasgow, Lanark, Scotland

[3] City Univ London, Bayes Business Sch, London, England

[4] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China

[5] Univ Colorado Boulder, Dept Comp Sci, Boulder, CO USA

[6] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI USA

[7] Fudan Univ, Sch Microelect, Shanghai, Peoples R China

来源：

PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT | 2024年 / 8卷 / 02期

关键词：

Smart eyewear; large language model; common ground; context-aware;

D O I：

10.1145/3659600

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Developing chatbots as personal companions has long been a goal of artificial intelligence researchers. Recent advances in Large Language Models (LLMs) have delivered a practical solution for endowing chatbots with anthropomorphic language capabilities. However, it takes more than LLMs to enable chatbots that can act as companions. Humans use their understanding of individual personalities to drive conversations. Chatbots also require this capability to enable human-like companionship. They should act based on personalized, real-time, and time-evolving knowledge of their users. We define such essential knowledge as the common ground between chatbots and their users, and we propose to build a common-ground-aware dialogue system from an LLM-based module, named OS-1, to enable chatbot companionship. Hosted by eyewear, OS-1 can sense the visual and audio signals the user receives and extract real-time contextual semantics. Those semantics are categorized and recorded to formulate historical contexts from which the user's profile is distilled and evolves over time, i.e., OS-1 gradually learns about its user. OS-1 combines knowledge from real-time semantics, historical contexts, and user-specific profiles to produce a common-ground-aware prompt input into the LLM module. The LLM's output is converted to audio, spoken to the wearer when appropriate. We conduct laboratory and in-field studies to assess OS-1's ability to build common ground between the chatbot and its user. The technical feasibility and capabilities of the system are also evaluated. Our results show that by utilizing personal context, OS-1 progressively develops a better understanding of its users. This enhances user satisfaction and potentially leads to various personal service scenarios, such as emotional support and assistance.

引用

页数：41

共 45 条

[31] Letter to the Editor: Value-based Healthcare: Can Generative Artificial Intelligence and Large Language Models be a Catalyst for Value-based Healthcare?
Porter, Matt A.
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2024, 482 (05) : 901 - 904
[32] An Embodied Intelligence System for Coal Mine Safety Assessment Based on Multi-Level Large Language Models
Sun, Yi
Ji, Faxiu
SENSORS, 2025, 25 (02)
[33] Intelligent design and optimization system for shear wall structures based on large language models and generative artificial intelligence
Qin, Sizhong
Guan, Hong
Liao, Wenjie
Gu, Yi
Zheng, Zhe
Xue, Hongjing
Lu, Xinzheng
JOURNAL OF BUILDING ENGINEERING, 2024, 95
[34] Building a hospitable and reliable dialogue system for android robots: a scenario-based approach with large language models
Yamazaki, Takato
Yoshikawa, Katsumasa
Kawamoto, Toshiki
Mizumoto, Tomoya
Ohagi, Masaya
Sato, Toshinori
ADVANCED ROBOTICS, 2023, 37 (21) : 1364 - 1381
[35] Reply to the Letter to the Editor: Value-based Healthcare: Can Generative Artificial Intelligence and Large Language Models be a Catalyst for Value-based Healthcare?
Jayakumar, Prakash
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2024, 482 (05) : 905 - 906
[36] Research on Engineering Management Question-answering System in the Communication Industry Based on Large Language Models and Knowledge Graphs
Jiang, Yingdi
Yao, Jiarui
Li, Fangfei
Zhang, Yan
PROCEEDINGS OF THE 2024 THE 7TH INTERNATIONAL CONFERENCE ON MACHINE VISION AND APPLICATIONS, ICMVA 2024, 2024, : 100 - 105
[37] Keep Eyes on the Sentence: An Interactive Sentence Simplification System for English Learners Based on Eye Tracking and Large Language Models
Higasa, Taichi
Tanaka, Keitaro
Feng, Qi
Morishima, Shigeo
EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
[38] iKnowiSee: AR Glasses with Language Learning Translation System and Identity Recognition System Built Based on Large Pre-trained Models of Language and Vision and Internet of Things Technology
Liang, Qiwei
Chen, Yikeng
Li, Wenbiao
Lai, Minghao
Ni, Wenjian
Qiu, Hong
INTELLIGENT NETWORKED THINGS, CINT 2024, PT II, 2024, 2139 : 12 - 24
[39] Can natural language processing or large language models replace human operators for pre-processing word and sentence-based free comments sensory evaluation data?
Visalli, Michel
Symoneaux, Ronan
Mursic, Cecile
Touret, Margaux
Lourtioux, Flore
Coulibaly, Kipedene
Mahieu, Benjamin
FOOD QUALITY AND PREFERENCE, 2025, 127
[40] Research on a traditional Chinese medicine case-based question-answering system integrating large language models and knowledge graphs
Duan, Yuchen
Zhou, Qingqing
Li, Yu
Qin, Chi
Wang, Ziyang
Kan, Hongxing
Hu, Jili
FRONTIERS IN MEDICINE, 2025, 11

← 1 2 3 4 5 →