Can Large Language Models Be Good Companions? An LLM-Based Eyewear System with Conversational Common Ground

被引：0

作者：

Xu, Zhenyu ^{[1
]}

Xu, Hailin ^{[1
]}

Lu, Zhouyang ^{[1
]}

Zhao, Yingying ^{[2
]}

Zhu, Rui ^{[3
]}

Wang, Yujiang ^{[4
]}

Dong, Mingzhi ^{[1
]}

Chang, Yuhu ^{[1
]}

Lv, Qin ^{[5
]}

Dick, Robert P. ^{[6
]}

Yang, Fan ^{[7
]}

Lu, Tun ^{[1
]}

Gu, Ning ^{[1
]}

Shang, Li ^{[1
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

[2] Univ Strathclyde, Dept Comp & Informat Sci, Glasgow, Lanark, Scotland

[3] City Univ London, Bayes Business Sch, London, England

[4] Oxford Suzhou Ctr Adv Res, Suzhou, Peoples R China

[5] Univ Colorado Boulder, Dept Comp Sci, Boulder, CO USA

[6] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI USA

[7] Fudan Univ, Sch Microelect, Shanghai, Peoples R China

来源：

PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT | 2024年 / 8卷 / 02期

关键词：

Smart eyewear; large language model; common ground; context-aware;

D O I：

10.1145/3659600

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Developing chatbots as personal companions has long been a goal of artificial intelligence researchers. Recent advances in Large Language Models (LLMs) have delivered a practical solution for endowing chatbots with anthropomorphic language capabilities. However, it takes more than LLMs to enable chatbots that can act as companions. Humans use their understanding of individual personalities to drive conversations. Chatbots also require this capability to enable human-like companionship. They should act based on personalized, real-time, and time-evolving knowledge of their users. We define such essential knowledge as the common ground between chatbots and their users, and we propose to build a common-ground-aware dialogue system from an LLM-based module, named OS-1, to enable chatbot companionship. Hosted by eyewear, OS-1 can sense the visual and audio signals the user receives and extract real-time contextual semantics. Those semantics are categorized and recorded to formulate historical contexts from which the user's profile is distilled and evolves over time, i.e., OS-1 gradually learns about its user. OS-1 combines knowledge from real-time semantics, historical contexts, and user-specific profiles to produce a common-ground-aware prompt input into the LLM module. The LLM's output is converted to audio, spoken to the wearer when appropriate. We conduct laboratory and in-field studies to assess OS-1's ability to build common ground between the chatbot and its user. The technical feasibility and capabilities of the system are also evaluated. Our results show that by utilizing personal context, OS-1 progressively develops a better understanding of its users. This enhances user satisfaction and potentially leads to various personal service scenarios, such as emotional support and assistance.

引用

页数：41

共 45 条

[21] Managing the Personality of NPCs with Your Interactions: A Game Design System Based on Large Language Models
Dai, Muyun
Yuan, Chun
Nie, Xiaomei
HCI IN GAMES, PT I, HCI-GAMES 2024, 2024, 14730 : 247 - 259
[22] Toward an efficient extractive Arabic text summarisation system based on Arabic large language models
Bourahouat, Ghizlane
Abourezq, Manar
Daoudi, Najima
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
[23] Assessment of large language models for use in generative design of model based spacecraft system architectures
Timperley, Louis Richard
Berthoud, Lucy
Snider, Chris
Tryfonas, Theo
JOURNAL OF ENGINEERING DESIGN, 2025,
[24] Value-based Healthcare: Can Generative Artificial Intelligence and Large Language Models be a Catalyst for Value-based Healthcare?
Jayakumar, Prakash
Nijhuis, Koen D. Oude
Oosterhoff, Jacobien H. F.
Bozic, Kevin J.
CLINICAL ORTHOPAEDICS AND RELATED RESEARCH, 2023, 481 (10) : 1890 - 1894
[25] Good yields of common purslane with a high fatty acid content can be obtained in a peat-based floating system
Cros, Victor
Martinez-Sanchez, Juan Jose
Franco, Jose Antonio
HORTTECHNOLOGY, 2007, 17 (01) : 14 - 20
[26] Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs
Xia, Haojun
Zheng, Zhen
Wu, Xiaoxia
Chen, Shiyang
Yao, Zhewei
Youn, Stephen
Bakhtiari, Arash
Wyatt, Michael
Zhuang, Donglin
Zhou, Zhongzhu
Ruwase, Olatunji
He, Yuxiong
Song, Shuaiwen Leon
PROCEEDINGS OF THE 2024 USENIX ANNUAL TECHNICAL CONFERENCE, ATC 2024, 2024, : 699 - 713
[27] VAEnvGen: A Real-Time Virtual Agent Environment Generation System Based on Large Language Models
Wu, Jingyu
Chen, Pengchen
Chen, Shi
Wei, Xiang
Sun, Lingyun
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2024,
[28] LSTM-Based Language Models for Very Large Vocabulary Continuous Russian Speech Recognition System
Kipyatkova, Irina
SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 219 - 226
[29] Metadata and Review-Based Hybrid Apparel Recommendation System Using Cascaded Large Language Models
Roy, Sanjiban Sekhar
Kumar, Ayush
Suresh Kumar, Rishikesh
IEEE ACCESS, 2024, 12 : 140053 - 140071
[30] ACIGS: An automated large-scale crops image generation system based on large visual language multi-modal models
Liu, Bolong
Zhang, Hao
Liu, Jie
Wang, Qiang
2023 20TH ANNUAL IEEE INTERNATIONAL CONFERENCE ON SENSING, COMMUNICATION, AND NETWORKING, SECON, 2023,

← 1 2 3 4 5 →