ChatGPT for shaping the future of dentistry: the potential of multi-modal large language model

被引:84
|
作者
Huang, Hanyao [1 ,2 ,3 ]
Zheng, Ou [4 ]
Wang, Dongdong [4 ]
Yin, Jiayi [1 ,2 ,3 ]
Wang, Zijin [4 ]
Ding, Shengxuan [5 ]
Yin, Heng [1 ,2 ,3 ]
Xu, Chuan [6 ,7 ]
Yang, Renjie [8 ,9 ,10 ]
Zheng, Qian [1 ,2 ,3 ]
Shi, Bing [1 ,2 ,3 ]
机构
[1] Sichuan Univ, West China Hosp Stomatol, State Key Lab Oral Dis, Chengdu, Peoples R China
[2] Sichuan Univ, West China Hosp Stomatol, Natl Clin Res Ctr Oral Dis, Chengdu, Peoples R China
[3] Sichuan Univ, West China Hosp Stomatol, Dept Oral & Maxillofacial Surg, Chengdu, Peoples R China
[4] Univ Cent Florida, Dept Civil Environm & Construct Engn, Orlando, FL 32816 USA
[5] Univ Cent Florida, Coll Transportat Engn, Orlando, FL USA
[6] Southwest Jiaotong Univ, Sch Transportat & Logist, Chengdu, Peoples R China
[7] NYU, Ctr C2SMART, Tandon Sch Engn, Brooklyn, NY USA
[8] Sichuan Univ, West China Hosp Stomatol, State Key Lab Oral Dis, Chengdu, Peoples R China
[9] Sichuan Univ, West China Hosp Stomatol, Natl Clin Res Ctr Oral Dis, Chengdu, Peoples R China
[10] Sichuan Univ, West China Hosp Stomatol, Eastern Clin, Chengdu, Peoples R China
关键词
DIAGNOSIS; OUTCOMES; RECORDS; CANCER;
D O I
10.1038/s41368-023-00239-y
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
The ChatGPT, a lite and conversational variant of Generative Pretrained Transformer 4 (GPT-4) developed by OpenAI, is one of the milestone Large Language Models (LLMs) with billions of parameters. LLMs have stirred up much interest among researchers and practitioners in their impressive skills in natural language processing tasks, which profoundly impact various fields. This paper mainly discusses the future applications of LLMs in dentistry. We introduce two primary LLM deployment methods in dentistry, including automated dental diagnosis and cross-modal dental diagnosis, and examine their potential applications. Especially, equipped with a cross-modal encoder, a single LLM can manage multi-source data and conduct advanced natural language reasoning to perform complex clinical operations. We also present cases to demonstrate the potential of a fully automatic Multi-Modal LLM AI system for dentistry clinical application. While LLMs offer significant potential benefits, the challenges, such as data privacy, data quality, and model bias, need further study. Overall, LLMs have the potential to revolutionize dental diagnosis and treatment, which indicates a promising avenue for clinical application and research in dentistry.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Mobile language learning with multimedia and multi-modal interfaces
    Joseph, Sam
    Uther, Maria
    FOURTH IEEE INTERNATIONAL WORKSHOP ON WIRELESS, MOBILE AND UBIQUITOUS TECHNOLOGY IN EDUCATION, PROCEEDINGS, 2006, : 124 - +
  • [42] On the use of Multi-Modal Sensing in Sign Language Classification
    Sharma, Sneha
    Gupta, Rinki
    Kumar, Arun
    2019 6TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2019, : 495 - 500
  • [43] Skeleton Aware Multi-modal Sign Language Recognition
    Jiang, Songyao
    Sun, Bin
    Wang, Lichen
    Bai, Yue
    Li, Kunpeng
    Fu, Yun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3408 - 3418
  • [44] Skeleton aware multi-modal sign language recognition
    Jiang, Songyao
    Sun, Bin
    Wang, Lichen
    Bai, Yue
    Li, Kunpeng
    Fu, Yun
    arXiv, 2021,
  • [45] Multi-modal language input: A learned superadditive effect
    Cheetham, Dominic
    APPLIED LINGUISTICS REVIEW, 2019, 10 (02) : 179 - 200
  • [46] Multi-modal interfaces for future applications of augmented cognition
    Vice, J
    Lockerd, A
    Lathan, C
    Foundations of Augmented Cognition, Vol 11, 2005, : 21 - 27
  • [47] Multi-modal and multi-model interrogation of large-scale functional brain networks
    Castaldo, Francesca
    dos Santos, Francisco Pascoa
    Timms, Ryan C.
    Cabral, Joana
    Vohryzek, Jakub
    Deco, Gustavo
    Woolrich, Mark
    Friston, Karl
    Verschure, Paul
    Litvak, Vladimir
    NEUROIMAGE, 2023, 277
  • [48] Comparison of Multi-Modal Large Language Models with Deep Learning Models for Medical Image Classification
    Than, Joel Chia Ming
    Vong, Wan Tze
    Yong, Kelvin Sheng Chek
    2024 IEEE 8TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, ICSIPA, 2024,
  • [49] VirtuWander: Enhancing Multi-modal Interaction for Virtual Tour Guidance through Large Language Models
    Wang, Zhan
    Yuan, Lin-Ping
    Wang, Liangwei
    Jiang, Bingchuan
    Zeng, Wei
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS, CHI 2024, 2024,
  • [50] SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models
    Lin, Ziyi
    Liu, Dongyang
    Zhang, Renrui
    Gao, Peng
    Qiu, Longtian
    Xiao, Han
    Qiu, Han
    Shao, Wenqi
    Chen, Keqin
    Han, Jiaming
    Huang, Siyuan
    Zhang, Yichi
    He, Xuming
    Qiao, Yu
    Li, Hongsheng
    COMPUTER VISION - ECCV 2024, PT LXII, 2025, 15120 : 36 - 55