Poster: Enabling Agent-centric Interaction on Smartphones with LLM-based UI Reassembling

被引:0
|
作者
Wen, Hao [1 ]
Du, Wenjie [1 ]
Li, Yuanchun [1 ]
Liu, Yunxin [1 ]
机构
[1] Tsinghua Univ, Inst AI Ind Res AIR, Beijing, Peoples R China
关键词
UI reassembling; LLM agent; mobile device;
D O I
10.1145/3643832.3661432
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this poster, we introduce a novel dynamic user interface (UI) specifically designed for mobile devices powered by large language models (LLMs) agents. The advent of LLMs has led to a surge in deploying LLM-based agents on personal and Internet of Things (IoT) devices, with the aim of facilitating various daily tasks through device manipulation. However, this integration poses a significant challenge: how to intelligently and flexibly select and present information both during and after the execution of tasks, ensuring users are well-informed about the operations and can access the desired results conveniently. To address this challenge, we propose a UI reassembling method. This method allows for analyzing and strategically combining different mobile applications and their UI components, enabling the dynamic construction and adjustment of UIs tailored to user needs. Our prototype exhibits promising performance, with the UI selection module achieving an F1 score of 0.74. This innovative approach opens up exciting possibilities of new user-device interaction paradigm, leveraging the capabilities of LLMs to enhance the user experience in handling mobile and IoT devices.
引用
收藏
页码:706 / 707
页数:2
相关论文
共 25 条
  • [21] ComfortLearn: Enabling agent-based occupant-centric building controls
    Quintana, Matias
    Nagy, Zoltan
    Tartarini, Federico
    Schiavon, Stefano
    Miller, Clayton
    PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 475 - 478
  • [22] Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Based Zero-Shot Object Navigation
    Dorbala, Vishnu Sashank
    Mullen, James F.
    Manocha, Dinesh
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4083 - 4090
  • [23] "My agent understands me better": Integrating Dynamic Human-like Memory Recall and Consolidation in LLM-Based Agents
    Hou, Yuki
    Tamoto, Haruki
    Miyashita, Homei
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [24] HandSee: Enabling Full Hand Interaction on Smartphones with Front Camera-based Stereo Vision
    Yu, Chun
    Wei, Xiaoying
    Vachher, Shubh
    Qin, Yue
    Liang, Chen
    Weng, Yueting
    Gu, Yizheng
    Shi, Yuanchun
    CHI 2019: PROCEEDINGS OF THE 2019 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2019,
  • [25] An LLM-Based Framework for Zero-Shot De-Identifying Flexible Text Data in Protected Health Information Enabling Potential Risk-Informed Patient Safety
    Chang, C. W.
    Hu, M.
    Ghavidel, B.
    Wynne, J. F.
    Qiu, R. L. J.
    Washington, M.
    Kayode, O.
    Chin, W. G.
    Yang, K.
    Scott, J. G.
    Patel, A. B., Jr.
    Yang, X.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2024, 120 (02): : E518 - E518