ChatGPT for shaping the future of dentistry: the potential of multi-modal large language model

被引:84
|
作者
Huang, Hanyao [1 ,2 ,3 ]
Zheng, Ou [4 ]
Wang, Dongdong [4 ]
Yin, Jiayi [1 ,2 ,3 ]
Wang, Zijin [4 ]
Ding, Shengxuan [5 ]
Yin, Heng [1 ,2 ,3 ]
Xu, Chuan [6 ,7 ]
Yang, Renjie [8 ,9 ,10 ]
Zheng, Qian [1 ,2 ,3 ]
Shi, Bing [1 ,2 ,3 ]
机构
[1] Sichuan Univ, West China Hosp Stomatol, State Key Lab Oral Dis, Chengdu, Peoples R China
[2] Sichuan Univ, West China Hosp Stomatol, Natl Clin Res Ctr Oral Dis, Chengdu, Peoples R China
[3] Sichuan Univ, West China Hosp Stomatol, Dept Oral & Maxillofacial Surg, Chengdu, Peoples R China
[4] Univ Cent Florida, Dept Civil Environm & Construct Engn, Orlando, FL 32816 USA
[5] Univ Cent Florida, Coll Transportat Engn, Orlando, FL USA
[6] Southwest Jiaotong Univ, Sch Transportat & Logist, Chengdu, Peoples R China
[7] NYU, Ctr C2SMART, Tandon Sch Engn, Brooklyn, NY USA
[8] Sichuan Univ, West China Hosp Stomatol, State Key Lab Oral Dis, Chengdu, Peoples R China
[9] Sichuan Univ, West China Hosp Stomatol, Natl Clin Res Ctr Oral Dis, Chengdu, Peoples R China
[10] Sichuan Univ, West China Hosp Stomatol, Eastern Clin, Chengdu, Peoples R China
关键词
DIAGNOSIS; OUTCOMES; RECORDS; CANCER;
D O I
10.1038/s41368-023-00239-y
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
The ChatGPT, a lite and conversational variant of Generative Pretrained Transformer 4 (GPT-4) developed by OpenAI, is one of the milestone Large Language Models (LLMs) with billions of parameters. LLMs have stirred up much interest among researchers and practitioners in their impressive skills in natural language processing tasks, which profoundly impact various fields. This paper mainly discusses the future applications of LLMs in dentistry. We introduce two primary LLM deployment methods in dentistry, including automated dental diagnosis and cross-modal dental diagnosis, and examine their potential applications. Especially, equipped with a cross-modal encoder, a single LLM can manage multi-source data and conduct advanced natural language reasoning to perform complex clinical operations. We also present cases to demonstrate the potential of a fully automatic Multi-Modal LLM AI system for dentistry clinical application. While LLMs offer significant potential benefits, the challenges, such as data privacy, data quality, and model bias, need further study. Overall, LLMs have the potential to revolutionize dental diagnosis and treatment, which indicates a promising avenue for clinical application and research in dentistry.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU
    Wanchaitanawong, Napat
    Tanaka, Masayuki
    Shibata, Takashi
    Okutomi, Masatoshi
    PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [22] Directing Humanoids in a Multi-modal Command Language
    Oka, Tetsushi
    Abe, Toyokazu
    Shimoji, Masato
    Nakamura, Takuya
    Sugita, Kaoru
    Yokota, Masao
    2008 17TH IEEE INTERNATIONAL SYMPOSIUM ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, VOLS 1 AND 2, 2008, : 580 - 585
  • [23] A Multi-Modal Framework for Future Emergency Systems
    Basil, Ahmed Osama
    Mu, Mu
    Agyeman, Michael Opoku
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 17 - 20
  • [24] Towards automation in using multi-modal language resources: compatibility and interoperability for multi-modal features in Kachako
    Kano, Yoshinobu
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1098 - 1101
  • [25] AUTSL: A Large Scale Multi-Modal Turkish Sign Language Dataset and Baseline Methods
    Sincan, Ozge Mercanoglu
    Keles, Hacer Yalim
    IEEE ACCESS, 2020, 8 : 181340 - 181355
  • [26] VGV: Verilog Generation using Visual Capabilities of Multi-Modal Large Language Models
    Wong, Sam-Zaak
    Wan, Gwok-Waa
    Liu, Dongping
    Wang, Xi
    2024 IEEE LLM AIDED DESIGN WORKSHOP, LAD 2024, 2024,
  • [27] Knowledge Enhanced Vision and Language Model for Multi-Modal Fake News Detection
    Gao, Xingyu
    Wang, Xi
    Chen, Zhenyu
    Zhou, Wei
    Hoi, Steven C. H.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8312 - 8322
  • [28] LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model
    Zhu, Yichen
    Zhu, Minjie
    Liu, Ning
    Xu, Zhiyuan
    Peng, Yaxin
    PROCEEDINGS OF THE 1ST INTERNATIONAL WORKSHOP ON EFFICIENT MULTIMEDIA COMPUTING UNDER LIMITED RESOURCES, EMCLR 2024, 2024, : 18 - 22
  • [29] Probing Multi-modal Machine Translation with Pre-trained Language Model
    Kong, Yawei
    Fan, Kai
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3689 - 3699
  • [30] A spiking neural network model of multi-modal language processing of robot instructions
    Panchev, C
    BIOMIMETIC NEURAL LEARNING FOR INTELLIGENT ROBOTS: INTELLIGENT SYSTEMS, COGNITIVE ROBOTICS, AND NEUROSCIENCE, 2005, 3575 : 182 - 210