ChatGPT-4 and Human Researchers Are Equal in Writing Scientific Introduction Sections: A Blinded, Randomized, Non-inferiority Controlled Study

被引：8

作者：

Sikander, Binyamin ^{[1
]}

Baker, Jason J. ^{[1
]}

Deveci, Can D. ^{[1
]}

Lund, Lars ^{[2
]}

Rosenberg, Jacob ^{[1
]}

机构：

[1] Herlev Hosp, Surg, Herlev, Denmark

[2] Odense Univ Hosp, Urol, Odense, Denmark

来源：

CUREUS JOURNAL OF MEDICAL SCIENCE | 2023年 / 15卷 / 11期

关键词：

natural language processing; chatbot; artificial intelligence and writing; artificial intelligence in medicine; gpt-4; chatgpt;

D O I：

10.7759/cureus.49019

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Background Natural language processing models are increasingly used in scientific research, and their ability to perform various tasks in the research process is rapidly advancing. This study aims to investigate whether Generative Pre-trained Transformer 4 (GPT-4) is equal to humans in writing introduction sections for scientific articles.Methods This randomized non-inferiority study was reported according to the Consolidated Standards of Reporting Trials for non-inferiority trials and artificial intelligence (AI) guidelines. GPT-4 was instructed to synthesize 18 introduction sections based on the aim of previously published studies, and these sections were compared to the human-written introductions already published in a medical journal. Eight blinded assessors randomly evaluated the introduction sections using 1-10 Likert scales.Results There was no significant difference between GPT-4 and human introductions regarding publishability and content quality. GPT-4 had one point significantly better scores in readability, which was considered a non -relevant difference. The majority of assessors (59%) preferred GPT-4, while 33% preferred human-written introductions. Based on Lix and Flesch-Kincaid scores, GPT-4 introductions were 10 and two points higher, respectively, indicating that the sentences were longer and had longer words.Conclusion GPT-4 was found to be equal to humans in writing introductions regarding publishability, readability, and content quality. The majority of assessors preferred GPT-4 introductions and less than half could determine which were written by GPT-4 or humans. These findings suggest that GPT-4 can be a useful tool for writing introduction sections, and further studies should evaluate its ability to write other parts of scientific articles.

引用

页数：10

共 50 条

[31] Challenges of defining a non-inferiority margin: a case study of non-inferiority randomized controlled trials of oral anti-thrombolytic agents for prophylaxis of venous thromboembolic events after orthopedic surgery
Grace Wangge
Olaf H Klungel
Kit CB Roes
Antonius de Boer
Arno W Hoes
Mirjam J Knol
Trials, 12 (Suppl 1)
[32] Laser acupuncture versus oral glucose administration for pain prevention in term neonates: an observer-blinded non-inferiority randomized controlled clinical trial
Stadler, Jasmin
Avian, Alexander
Pichler, Gerhard
Posch, Katrin
Urlesberger, Berndt
Raith, Wolfgang
ACUPUNCTURE IN MEDICINE, 2021, 39 (06) : 589 - 595
[33] Oral oxycodone offers equivalent analgesia to intravenous patient-controlled analgesia after total hip replacement: a randomized, single-centre, non-blinded, non-inferiority study
Rothwell, M. P.
Pearson, D.
Hunter, J. D.
Mitchell, P. A.
Graham-Woollard, T.
Goodwin, L.
Dunn, G.
BRITISH JOURNAL OF ANAESTHESIA, 2011, 106 (06) : 865 - 872
[34] SELF-DIRECTED VIDEO VS. INSTRUCTOR-BASED NEONATAL RESUSCITATION TRAINING OF NOVICE PROVIDERS: A RANDOMIZED. CONTROLLED, BLINDED, NON-INFERIORITY, INTERNATIONAL STUDY
Dannaway, D.
Szyld, E. G.
JOURNAL OF INVESTIGATIVE MEDICINE, 2019, 67 (02) : 632 - 633
[35] Effect and safety of anaprazole in the treatment of duodenal ulcers: a randomized, rabeprazole-controlled, phase III non-inferiority study
Zhu, Huiyun
Pan, Xue
Zhang, Li
Sun, Hongxin
Fan, Huizhen
Pan, Zhongwei
Huang, Caibin
Shi, Zhenwang
Ding, Jin
Wang, Qi
Du, Yiqi
Lyu, Nonghua
Li, Zhaoshen
CHINESE MEDICAL JOURNAL, 2022, 135 (24) : 2941 - 2949
[36] Performance and patients' satisfaction with the A7+TouchCare insulin patch pump system: A randomized controlled non-inferiority study
Amadou, Coralie
Melki, Vincent
Allain, Jennifer
Clavel, Sylvaine
Gouet, Didier
Chaillous, Lucy
Catargi, Bogdan
Schaeplynck-Belicard, Pauline
Petit, Catherine
Thivolet, Charles
Penfornis, Alfred
PLOS ONE, 2023, 18 (08):
[37] Effect and safety of anaprazole in the treatment of duodenal ulcers: a randomized, rabeprazole-controlled, phase III non-inferiority study
Zhu Huiyun
Pan Xue
Zhang Li
Sun Hongxin
Fan Huizhen
Pan Zhongwei
Huang Caibin
Shi Zhenwang
Ding Jin
Wang Qi
Du Yiqi
Lyu Nonghua
Li Zhaoshen
中华医学杂志英文版, 2022, 135 (24)
[38] Imrecoxib versus celecoxib as postoperative analgesia for patients receiving arthroscopic knee surgery: a randomized, controlled, non-inferiority study
Wei Guo
Ying Liu
Jingjing Li
Inflammopharmacology, 2022, 30 : 875 - 881
[39] Imrecoxib versus celecoxib as postoperative analgesia for patients receiving arthroscopic knee surgery: a randomized, controlled, non-inferiority study
Guo, Wei
Liu, Ying
Li, Jingjing
INFLAMMOPHARMACOLOGY, 2022, 30 (03) : 875 - 881
[40] Is Remote Stretching Based On Myofascial Chains Equally Effective As Local Exercise? A Randomized Controlled Non-inferiority Study.
Wilke, Jan
Niederer, Daniel
Welpe, Nadine
Vogt, Lutz
Banzer, Winfried
MEDICINE AND SCIENCE IN SPORTS AND EXERCISE, 2016, 48 (05): : 498 - 498

← 1 2 3 4 5 →