Sparks of Generative Pretrained Transformers in Edge Intelligence for the Metaverse: Caching and Inference for Mobile Artificial Intelligence-Generated Content Services

被引:7
|
作者
Xu, Minrui [1 ]
Niyato, Dusit [1 ]
Zhang, Hongliang [2 ]
Kang, Jiawen [3 ]
Xiong, Zehui [4 ]
Mao, Shiwen [5 ]
Han, Zhu [6 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] Peking Univ, Sch Elect, Beijing 100084, Peoples R China
[3] Guangdong Univ Technol, Guangzhou 510006, Guangdong, Peoples R China
[4] Singapore Univ Technol & Design, Singapore 487372, Singapore
[5] Auburn Univ, Wireless Engn Res & Educ Ctr, Auburn, AL 36849 USA
[6] Univ Houston, Elect & Comp Engn Dept, Houston, TX 77204 USA
来源
IEEE VEHICULAR TECHNOLOGY MAGAZINE | 2023年 / 18卷 / 04期
基金
美国国家科学基金会; 新加坡国家研究基金会;
关键词
Task analysis; Servers; Metaverse; Artificial neural networks; Artificial intelligence; Adaptation models; Biological system modeling;
D O I
10.1109/MVT.2023.3323757
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Aiming at achieving artificial general intelligence (AGI) for the metaverse, pretrained foundation models (PFMs), e.g., generative pretrained transformers (GPTs), can effectively provide various artificial intelligence (AI) services, such as autonomous driving, digital twins (DTs), and AI-generated content (AIGC) for extended reality (XR). With the advantages of low latency and privacy-preserving, serving PFMs of mobile AI services in edge intelligence is a viable solution for caching and executing PFMs on edge servers with limited computing resources and GPU memory. However, PFMs typically consist of billions of parameters that are computation- and memory-intensive for edge servers during loading and execution. In this article, we investigate edge PFM serving problems for mobile AIGC services of the metaverse. First, we introduce the fundamentals of PFMs and discuss their characteristic fine-tuning and inference methods in edge intelligence. Then, we propose a novel framework of joint model caching and inference for managing models and allocating resources to satisfy users' requests efficiently. Furthermore, considering the in-context learning ability of PFMs, we propose a new metric to evaluate the freshness and relevance between examples in demonstrations and executing tasks, namely the Age of Context (AoC). Finally, we propose a least-context (LC) algorithm for managing cached models at edge servers by balancing the tradeoff among latency, energy consumption, and accuracy.
引用
收藏
页码:35 / 44
页数:10
相关论文
共 19 条
  • [1] Joint Foundation Model Caching and Inference of Generative AI Services for Edge Intelligence
    Xu, Minrui
    Niyato, Dusit
    Zhang, Hongliang
    Kang, Jiawen
    Xiong, Zehui
    Mao, Shiwen
    Han, Zhu
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 3548 - 3553
  • [2] Performance of Artificial Intelligence Content Detectors Using Human and Artificial Intelligence-Generated Scientific Writing
    Flitcroft, Madelyn A.
    Sheriff, Salma A.
    Wolfrath, Nathan
    Maddula, Ragasnehith
    McConnell, Laura
    Xing, Yun
    Haines, Krista L.
    Wong, Sandra L.
    Kothari, Anai N.
    [J]. ANNALS OF SURGICAL ONCOLOGY, 2024,
  • [3] Who Wrote This Editorial? The Use of Artificial Intelligence-Generated Content in Nursing Literature
    Fraser, Debbie
    [J]. NEONATAL NETWORK, 2023, 42 (03): : 115 - 117
  • [4] AI hallucination: towards a comprehensive classification of distorted information in artificial intelligence-generated content
    Sun, Yujie
    Sheng, Dongfang
    Zhou, Zihan
    Wu, Yifei
    [J]. HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2024, 11 (01):
  • [5] Exploring Designer Trust in Artificial Intelligence-Generated Content: TAM/TPB Model Study
    Wang, Shao-Feng
    Chen, Chun-Ching
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [6] Artificial Intelligence-Generated Content in Intelligent Transportation Systems: Learning to Copy, Change, and Create
    Lv, Yisheng
    [J]. IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2023, 15 (05) : 2 - 3
  • [7] Artificial intelligence-assisted generative pretrained transformers for applications of ChatGPT in higher education among graduates
    Jigna B. Prajapati
    Ashwini Kumar
    Sudarshan Singh
    Bhupendra Prajapati
    Yash Thakar
    Prashant R. Tambe
    Amit Ved
    [J]. SN Social Sciences, 4 (2):
  • [8] Artificial Intelligence-Generated Social Media Content Creation and Management Strategies for Plastic Surgeons
    Abi-Rafeh, Jad
    Cattelan, Leila
    Xu, Hong Hao
    Bassiri-Tehrani, Brian
    Kazan, Roy
    Nahai, Foad
    [J]. AESTHETIC SURGERY JOURNAL, 2024, 44 (07) : 769 - 778
  • [9] Artificial intelligence-generated peripheral blood film images using generative adversarial networks and diffusion models
    Fan, Bingwen Eugene
    Kwok, Ngai Tung Eric
    Chen, David Tao Yi
    Kuperan, Ponnudurai
    Abdul Latiff, Siti Thuraiya Binte
    Wong, Moh Sim
    Ong, Yi Xiong
    Lim, Kian Guan Eric
    Wong, Wei Yong Kevin
    Lim, Shu Ping
    Yong, Bryan Song Jun
    Shanmugam, Hemalatha
    Neo, Yuan Shan
    Ong, Yi Pin Joshua
    Winkler, Stefan
    [J]. AMERICAN JOURNAL OF HEMATOLOGY, 2023, 98 (09) : 1499 - 1502
  • [10] Leveraging artificial intelligence and mutual authentication to optimize content caching in edge data centers
    Marwan, Mbarek
    AlShahwan, Feda
    Afoudi, Yassine
    Temghart, Abdelkarim Ait
    Lazaar, Mohamed
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)