Assessing the Performance of Chat Generative Pretrained Transformer (ChatGPT) in Answering Andrology-Related Questions

被引:6
|
作者
Caglar, Ufuk [1 ]
Yildiz, Oguzhan [1 ]
Ozervarli, M. Firat [2 ]
Aydin, Resat [2 ]
Sarilar, Omer [1 ]
Ozgor, Faruk [1 ]
Ortac, Mazhar [2 ]
机构
[1] Haseki Training & Res Hosp, Dept Urol, Istanbul, Turkiye
[2] Istanbul Univ, Istanbul Sch Med, Dept Urol, Istanbul, Turkiye
关键词
Andrology; artificial intelligence; information sources;
D O I
10.5152/tud.2023.23171
中图分类号
R5 [内科学]; R69 [泌尿科学(泌尿生殖系疾病)];
学科分类号
1002 ; 100201 ;
摘要
Objective: The internet and social media have become primary sources of health information, with men frequently turning to these platforms before seeking professional help. Chat generative pretrained transformer (ChatGPT), an artificial intelligence model developed by OpenAI, has gained popularity as a natural language processing program. The present study evaluated the accuracy and reproducibility of ChatGPT's responses to andrology-related questions. Methods: The study analyzed frequently asked andrology questions from health forums, hospital websites, and social media platforms like YouTube and Instagram. Questions were categorized into topics like male hypogonadism, erectile dysfunction, etc. The European Association of Urology (EAU) guideline recommendations were also included. These questions were input into ChatGPT, and responses were evaluated by 3 experienced urologists who scored them on a scale of 1 to 4. Results: Out of 136 evaluated questions, 108 met the criteria. Of these, 87.9% received correct and adequate answers, 9.3% were correct but insufficient, and 3 responses contained both correct and incorrect information. No question was answered completely wrong. The highest correct answer rates were for disorders of ejaculation, penile curvature, and male hypogonadism. The EAU guideline-based questions achieved a correctness rate of 86.3%. The reproducibility of the answers was over 90%. Conclusion: The study found that ChatGPT provided accurate and reliable answers to over 80% of andrology-related questions. While limitations exist, such as potential outdated data and inability to understand emotional aspects, ChatGPT's potential in the health-care sector is promising. Collaborating with health-care professionals during artificial intelligence model development could enhance its reliability.
引用
收藏
页码:365 / 369
页数:92
相关论文
共 50 条
  • [1] Colorectal Cancer Prevention and Chat Generative Pretrained Transformer (ChatGPT)
    Daungsupawong, Hinpetch
    Wiwanitkit, Viroj
    [J]. JOURNAL OF CLINICAL GASTROENTEROLOGY, 2024, 58 (05) : 531 - 531
  • [2] Chat generative pretrained transformer: A disruptive or constructive technology?
    Deshmukh, Sonali Vijay
    [J]. JOURNAL OF THE INTERNATIONAL CLINICAL DENTAL RESEARCH ORGANIZATION, 2023, 15 (01) : 1 - 2
  • [3] Evaluating the performance of ChatGPT in answering questions related to urolithiasis
    Hakan Cakir
    Ufuk Caglar
    Oguzhan Yildiz
    Arda Meric
    Ali Ayranci
    Faruk Ozgor
    [J]. International Urology and Nephrology, 2024, 56 : 17 - 21
  • [4] Evaluating the performance of ChatGPT in answering questions related to urolithiasis
    Cakir, Hakan
    Caglar, Ufuk
    Yildiz, Oguzhan
    Meric, Arda
    Ayranci, Ali
    Ozgor, Faruk
    [J]. INTERNATIONAL UROLOGY AND NEPHROLOGY, 2024, 56 (01) : 17 - 21
  • [5] Utilizing Artificial Intelligence and Chat Generative Pretrained Transformer to Answer Questions About Clinical Scenarios in Neuroanesthesiology
    Blacker, Samuel N.
    Kang, Mia
    Chakraborty, Indranil
    Chowdhury, Tumul
    Williams, James
    Lewis, Carol
    Zimmer, Michael
    Wilson, Brad
    Lele, Abhijit V.
    [J]. JOURNAL OF NEUROSURGICAL ANESTHESIOLOGY, 2024, 36 (04) : 346 - 351
  • [6] Chatbot Generative Pretrained Transformer (ChatGPT) responses to questions about orthodontics in an updated version Authors ' response
    Kilinc, Delal Dara
    Mansiz, Duygu
    [J]. AMERICAN JOURNAL OF ORTHODONTICS AND DENTOFACIAL ORTHOPEDICS, 2024, 165 (06) : 614 - 616
  • [7] Chat Generative Pretrained Transformer is Actually Ready to Assist Physicians: Comment on "Is Chat Generative Pretrained Transformer Ready to Assist Physicians in Determining Appropriate Screening and Surveillance Recommendations?"
    Lim, Daniel Yan Zheng
    Tan, Yu Bin
    Koh, Jonathan Tian En
    Tung, Joshua Yi Min
    Sng, Gerald Gui Ren
    Tan, Damien Meng Yew
    Tan, Chee-Kiat
    [J]. JOURNAL OF CLINICAL GASTROENTEROLOGY, 2024, 58 (06) : 633 - 633
  • [8] Evaluating the performance of ChatGPT in answering questions related to pediatric urology
    Caglar, Ufuk
    Yildiz, Oguzhan
    Meric, Arda
    Ayranci, Ali
    Gelmis, Mucahit
    Sarilar, Omer
    Ozgor, Faruk
    [J]. JOURNAL OF PEDIATRIC UROLOGY, 2024, 20 (01) : 26.e1 - 26.e5
  • [9] Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma
    Daungsupawong, Hinpetch
    Wiwanitkit, Viroj
    [J]. CLINICAL AND MOLECULAR HEPATOLOGY, 2024, 30 (01)
  • [10] CHAT GENERATIVE PRETRAINED TRANSFORMER: EXTINCTION OF THE DESIGNER OR RISE OF AN AUGMENTED DESIGNER
    Gill, Amaninder Singh
    [J]. PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3B, 2023,