ChatGPT-4 and Human Researchers Are Equal in Writing Scientific Introduction Sections: A Blinded, Randomized, Non-inferiority Controlled Study

被引：8

作者：

Sikander, Binyamin ^{[1
]}

Baker, Jason J. ^{[1
]}

Deveci, Can D. ^{[1
]}

Lund, Lars ^{[2
]}

Rosenberg, Jacob ^{[1
]}

机构：

[1] Herlev Hosp, Surg, Herlev, Denmark

[2] Odense Univ Hosp, Urol, Odense, Denmark

来源：

CUREUS JOURNAL OF MEDICAL SCIENCE | 2023年 / 15卷 / 11期

关键词：

natural language processing; chatbot; artificial intelligence and writing; artificial intelligence in medicine; gpt-4; chatgpt;

D O I：

10.7759/cureus.49019

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Background Natural language processing models are increasingly used in scientific research, and their ability to perform various tasks in the research process is rapidly advancing. This study aims to investigate whether Generative Pre-trained Transformer 4 (GPT-4) is equal to humans in writing introduction sections for scientific articles.Methods This randomized non-inferiority study was reported according to the Consolidated Standards of Reporting Trials for non-inferiority trials and artificial intelligence (AI) guidelines. GPT-4 was instructed to synthesize 18 introduction sections based on the aim of previously published studies, and these sections were compared to the human-written introductions already published in a medical journal. Eight blinded assessors randomly evaluated the introduction sections using 1-10 Likert scales.Results There was no significant difference between GPT-4 and human introductions regarding publishability and content quality. GPT-4 had one point significantly better scores in readability, which was considered a non -relevant difference. The majority of assessors (59%) preferred GPT-4, while 33% preferred human-written introductions. Based on Lix and Flesch-Kincaid scores, GPT-4 introductions were 10 and two points higher, respectively, indicating that the sentences were longer and had longer words.Conclusion GPT-4 was found to be equal to humans in writing introductions regarding publishability, readability, and content quality. The majority of assessors preferred GPT-4 introductions and less than half could determine which were written by GPT-4 or humans. These findings suggest that GPT-4 can be a useful tool for writing introduction sections, and further studies should evaluate its ability to write other parts of scientific articles.

引用

页数：10

共 50 条

[21] Non-operative management of uncomplicated appendicitis in children: a randomized, controlled, non-inferiority study evaluating safety and efficacy
Adams, Susan Elizabeth
Perera, Meegodage Roshell Swindri
Fung, Saskia
Maxton, Jordon
Karpelowsky, Jonathan
ANZ JOURNAL OF SURGERY, 2024, 94 (09) : 1569 - 1577
[22] The efficacy of topical sesame oil in patients with knee osteoarthritis: A randomized double-blinded active-controlled non-inferiority clinical trial
Askari, Alireza
Ravansalar, Seyed Ali
Naghizadeh, Mohammad Mehdi
Mosavat, Seyed Hamdollah
Khodadoost, Mahmood
Jazani, Arezoo Moini
Hashempur, Mohammad Hashem
COMPLEMENTARY THERAPIES IN MEDICINE, 2019, 47
[23] Correction to: A brief intervention for PTSD versus treatment as usual: Study protocol for a non-inferiority randomized controlled trial
Halvor Stavland
Camilla Refvik
Jarle Eid
Rafiq Lockhat
Åsa Hammar
Trials, 22
[24] Home Biofeedback Therapy Improves Fecal Incontinence Severity and Quality of Life in a Non-Inferiority Randomized Controlled Study
Sharma, Amol
Xiang, Xuelian
Yan, Yun
Patcharatrakul, Tanisa
Parr, Rachel
Rao, Satish S. C.
AMERICAN JOURNAL OF GASTROENTEROLOGY, 2018, 113 : S245 - S245
[25] Weekly azathioprine pulse versus daily azathioprine in the treatment of Parthenium dermatitis: A non-inferiority randomized controlled study
Verma, Kaushal K.
Sethuraman, G.
Kalavani, M.
INDIAN JOURNAL OF DERMATOLOGY VENEREOLOGY & LEPROLOGY, 2015, 81 (03): : 251 - 256
[26] Biological Response of Irisin Induced by Different Types of Exercise in Obese Subjects: A Non-Inferiority Controlled Randomized Study
D'Amuri, Andrea
Raparelli, Valeria
Sanz, Juana Maria
Capatti, Eleonora
Di Vece, Francesca
Vaccari, Filippo
Lazzer, Stefano
Zuliani, Giovanni
Dalla Nora, Edoardo
Neri, Luca Maria
Passaro, Angelina
BIOLOGY-BASEL, 2022, 11 (03):
[27] Randomized, controlled, open-label, non-inferiority study of the CONSORT algorithm for individualized dosing of follitropin alfa
Olivennes, F.
Trew, G.
Borini, A.
Broekmans, F.
Arriagada, P.
Warne, D. W.
Howles, C. M.
REPRODUCTIVE BIOMEDICINE ONLINE, 2015, 30 (03) : 248 - 257
[28] Treatment of neonatal jaundice with filtered sunlight in Nigerian neonates: study protocol of a non-inferiority, randomized controlled trial
Tina M Slusher
Bolajoko O Olusanya
Hendrik J Vreman
Ronald J Wong
Ann M Brearley
Yvonne E Vaucher
David K Stevenson
Trials, 14
[29] Treatment of neonatal jaundice with filtered sunlight in Nigerian neonates: study protocol of a non-inferiority, randomized controlled trial
Slusher, Tina M.
Olusanya, Bolajoko O.
Vreman, Hendrik J.
Wong, Ronald J.
Brearley, Ann M.
Vaucher, Yvonne E.
Stevenson, David K.
TRIALS, 2013, 14
[30] The efficacy and safety of epidural morphine/hydromorphone in the treatment of intractable postherpetic neuralgia: A single-center, double-blinded, randomized controlled, prospective, and non-inferiority study
Sun, Yiping
Shen, Jiayi
Hei, Guang
Yun, Ji
Ma, Bingjie
Huang, Xuehua
Yu, Zhiyuan
Ma, Pingchuan
Ke, Ma
FRONTIERS IN PHARMACOLOGY, 2022, 13

← 1 2 3 4 5 →