ChatGPT-4 and Human Researchers Are Equal in Writing Scientific Introduction Sections: A Blinded, Randomized, Non-inferiority Controlled Study

被引:8
|
作者
Sikander, Binyamin [1 ]
Baker, Jason J. [1 ]
Deveci, Can D. [1 ]
Lund, Lars [2 ]
Rosenberg, Jacob [1 ]
机构
[1] Herlev Hosp, Surg, Herlev, Denmark
[2] Odense Univ Hosp, Urol, Odense, Denmark
关键词
natural language processing; chatbot; artificial intelligence and writing; artificial intelligence in medicine; gpt-4; chatgpt;
D O I
10.7759/cureus.49019
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Natural language processing models are increasingly used in scientific research, and their ability to perform various tasks in the research process is rapidly advancing. This study aims to investigate whether Generative Pre-trained Transformer 4 (GPT-4) is equal to humans in writing introduction sections for scientific articles.Methods This randomized non-inferiority study was reported according to the Consolidated Standards of Reporting Trials for non-inferiority trials and artificial intelligence (AI) guidelines. GPT-4 was instructed to synthesize 18 introduction sections based on the aim of previously published studies, and these sections were compared to the human-written introductions already published in a medical journal. Eight blinded assessors randomly evaluated the introduction sections using 1-10 Likert scales.Results There was no significant difference between GPT-4 and human introductions regarding publishability and content quality. GPT-4 had one point significantly better scores in readability, which was considered a non -relevant difference. The majority of assessors (59%) preferred GPT-4, while 33% preferred human-written introductions. Based on Lix and Flesch-Kincaid scores, GPT-4 introductions were 10 and two points higher, respectively, indicating that the sentences were longer and had longer words.Conclusion GPT-4 was found to be equal to humans in writing introductions regarding publishability, readability, and content quality. The majority of assessors preferred GPT-4 introductions and less than half could determine which were written by GPT-4 or humans. These findings suggest that GPT-4 can be a useful tool for writing introduction sections, and further studies should evaluate its ability to write other parts of scientific articles.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Challenges of defining a non-inferiority margin: a case study of non-inferiority randomized controlled trials of oral anti-thrombolytic agents for prophylaxis of venous thromboembolic events after orthopedic surgery
    Grace Wangge
    Olaf H Klungel
    Kit CB Roes
    Antonius de Boer
    Arno W Hoes
    Mirjam J Knol
    Trials, 12 (Suppl 1)
  • [32] Laser acupuncture versus oral glucose administration for pain prevention in term neonates: an observer-blinded non-inferiority randomized controlled clinical trial
    Stadler, Jasmin
    Avian, Alexander
    Pichler, Gerhard
    Posch, Katrin
    Urlesberger, Berndt
    Raith, Wolfgang
    ACUPUNCTURE IN MEDICINE, 2021, 39 (06) : 589 - 595
  • [33] Oral oxycodone offers equivalent analgesia to intravenous patient-controlled analgesia after total hip replacement: a randomized, single-centre, non-blinded, non-inferiority study
    Rothwell, M. P.
    Pearson, D.
    Hunter, J. D.
    Mitchell, P. A.
    Graham-Woollard, T.
    Goodwin, L.
    Dunn, G.
    BRITISH JOURNAL OF ANAESTHESIA, 2011, 106 (06) : 865 - 872
  • [34] SELF-DIRECTED VIDEO VS. INSTRUCTOR-BASED NEONATAL RESUSCITATION TRAINING OF NOVICE PROVIDERS: A RANDOMIZED. CONTROLLED, BLINDED, NON-INFERIORITY, INTERNATIONAL STUDY
    Dannaway, D.
    Szyld, E. G.
    JOURNAL OF INVESTIGATIVE MEDICINE, 2019, 67 (02) : 632 - 633
  • [35] Effect and safety of anaprazole in the treatment of duodenal ulcers: a randomized, rabeprazole-controlled, phase III non-inferiority study
    Zhu, Huiyun
    Pan, Xue
    Zhang, Li
    Sun, Hongxin
    Fan, Huizhen
    Pan, Zhongwei
    Huang, Caibin
    Shi, Zhenwang
    Ding, Jin
    Wang, Qi
    Du, Yiqi
    Lyu, Nonghua
    Li, Zhaoshen
    CHINESE MEDICAL JOURNAL, 2022, 135 (24) : 2941 - 2949
  • [36] Performance and patients' satisfaction with the A7+TouchCare insulin patch pump system: A randomized controlled non-inferiority study
    Amadou, Coralie
    Melki, Vincent
    Allain, Jennifer
    Clavel, Sylvaine
    Gouet, Didier
    Chaillous, Lucy
    Catargi, Bogdan
    Schaeplynck-Belicard, Pauline
    Petit, Catherine
    Thivolet, Charles
    Penfornis, Alfred
    PLOS ONE, 2023, 18 (08):
  • [37] Effect and safety of anaprazole in the treatment of duodenal ulcers: a randomized, rabeprazole-controlled, phase III non-inferiority study
    Zhu Huiyun
    Pan Xue
    Zhang Li
    Sun Hongxin
    Fan Huizhen
    Pan Zhongwei
    Huang Caibin
    Shi Zhenwang
    Ding Jin
    Wang Qi
    Du Yiqi
    Lyu Nonghua
    Li Zhaoshen
    中华医学杂志英文版, 2022, 135 (24)
  • [38] Imrecoxib versus celecoxib as postoperative analgesia for patients receiving arthroscopic knee surgery: a randomized, controlled, non-inferiority study
    Wei Guo
    Ying Liu
    Jingjing Li
    Inflammopharmacology, 2022, 30 : 875 - 881
  • [39] Imrecoxib versus celecoxib as postoperative analgesia for patients receiving arthroscopic knee surgery: a randomized, controlled, non-inferiority study
    Guo, Wei
    Liu, Ying
    Li, Jingjing
    INFLAMMOPHARMACOLOGY, 2022, 30 (03) : 875 - 881
  • [40] Is Remote Stretching Based On Myofascial Chains Equally Effective As Local Exercise? A Randomized Controlled Non-inferiority Study.
    Wilke, Jan
    Niederer, Daniel
    Welpe, Nadine
    Vogt, Lutz
    Banzer, Winfried
    MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2016, 48 (05): : 498 - 498