ChatGPT-4 and Human Researchers Are Equal in Writing Scientific Introduction Sections: A Blinded, Randomized, Non-inferiority Controlled Study

被引:8
|
作者
Sikander, Binyamin [1 ]
Baker, Jason J. [1 ]
Deveci, Can D. [1 ]
Lund, Lars [2 ]
Rosenberg, Jacob [1 ]
机构
[1] Herlev Hosp, Surg, Herlev, Denmark
[2] Odense Univ Hosp, Urol, Odense, Denmark
关键词
natural language processing; chatbot; artificial intelligence and writing; artificial intelligence in medicine; gpt-4; chatgpt;
D O I
10.7759/cureus.49019
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background Natural language processing models are increasingly used in scientific research, and their ability to perform various tasks in the research process is rapidly advancing. This study aims to investigate whether Generative Pre-trained Transformer 4 (GPT-4) is equal to humans in writing introduction sections for scientific articles.Methods This randomized non-inferiority study was reported according to the Consolidated Standards of Reporting Trials for non-inferiority trials and artificial intelligence (AI) guidelines. GPT-4 was instructed to synthesize 18 introduction sections based on the aim of previously published studies, and these sections were compared to the human-written introductions already published in a medical journal. Eight blinded assessors randomly evaluated the introduction sections using 1-10 Likert scales.Results There was no significant difference between GPT-4 and human introductions regarding publishability and content quality. GPT-4 had one point significantly better scores in readability, which was considered a non -relevant difference. The majority of assessors (59%) preferred GPT-4, while 33% preferred human-written introductions. Based on Lix and Flesch-Kincaid scores, GPT-4 introductions were 10 and two points higher, respectively, indicating that the sentences were longer and had longer words.Conclusion GPT-4 was found to be equal to humans in writing introductions regarding publishability, readability, and content quality. The majority of assessors preferred GPT-4 introductions and less than half could determine which were written by GPT-4 or humans. These findings suggest that GPT-4 can be a useful tool for writing introduction sections, and further studies should evaluate its ability to write other parts of scientific articles.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Non-operative management of uncomplicated appendicitis in children: a randomized, controlled, non-inferiority study evaluating safety and efficacy
    Adams, Susan Elizabeth
    Perera, Meegodage Roshell Swindri
    Fung, Saskia
    Maxton, Jordon
    Karpelowsky, Jonathan
    ANZ JOURNAL OF SURGERY, 2024, 94 (09) : 1569 - 1577
  • [22] The efficacy of topical sesame oil in patients with knee osteoarthritis: A randomized double-blinded active-controlled non-inferiority clinical trial
    Askari, Alireza
    Ravansalar, Seyed Ali
    Naghizadeh, Mohammad Mehdi
    Mosavat, Seyed Hamdollah
    Khodadoost, Mahmood
    Jazani, Arezoo Moini
    Hashempur, Mohammad Hashem
    COMPLEMENTARY THERAPIES IN MEDICINE, 2019, 47
  • [23] Correction to: A brief intervention for PTSD versus treatment as usual: Study protocol for a non-inferiority randomized controlled trial
    Halvor Stavland
    Camilla Refvik
    Jarle Eid
    Rafiq Lockhat
    Åsa Hammar
    Trials, 22
  • [24] Home Biofeedback Therapy Improves Fecal Incontinence Severity and Quality of Life in a Non-Inferiority Randomized Controlled Study
    Sharma, Amol
    Xiang, Xuelian
    Yan, Yun
    Patcharatrakul, Tanisa
    Parr, Rachel
    Rao, Satish S. C.
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2018, 113 : S245 - S245
  • [25] Weekly azathioprine pulse versus daily azathioprine in the treatment of Parthenium dermatitis: A non-inferiority randomized controlled study
    Verma, Kaushal K.
    Sethuraman, G.
    Kalavani, M.
    INDIAN JOURNAL OF DERMATOLOGY VENEREOLOGY & LEPROLOGY, 2015, 81 (03): : 251 - 256
  • [26] Biological Response of Irisin Induced by Different Types of Exercise in Obese Subjects: A Non-Inferiority Controlled Randomized Study
    D'Amuri, Andrea
    Raparelli, Valeria
    Sanz, Juana Maria
    Capatti, Eleonora
    Di Vece, Francesca
    Vaccari, Filippo
    Lazzer, Stefano
    Zuliani, Giovanni
    Dalla Nora, Edoardo
    Neri, Luca Maria
    Passaro, Angelina
    BIOLOGY-BASEL, 2022, 11 (03):
  • [27] Randomized, controlled, open-label, non-inferiority study of the CONSORT algorithm for individualized dosing of follitropin alfa
    Olivennes, F.
    Trew, G.
    Borini, A.
    Broekmans, F.
    Arriagada, P.
    Warne, D. W.
    Howles, C. M.
    REPRODUCTIVE BIOMEDICINE ONLINE, 2015, 30 (03) : 248 - 257
  • [28] Treatment of neonatal jaundice with filtered sunlight in Nigerian neonates: study protocol of a non-inferiority, randomized controlled trial
    Tina M Slusher
    Bolajoko O Olusanya
    Hendrik J Vreman
    Ronald J Wong
    Ann M Brearley
    Yvonne E Vaucher
    David K Stevenson
    Trials, 14
  • [29] Treatment of neonatal jaundice with filtered sunlight in Nigerian neonates: study protocol of a non-inferiority, randomized controlled trial
    Slusher, Tina M.
    Olusanya, Bolajoko O.
    Vreman, Hendrik J.
    Wong, Ronald J.
    Brearley, Ann M.
    Vaucher, Yvonne E.
    Stevenson, David K.
    TRIALS, 2013, 14
  • [30] The efficacy and safety of epidural morphine/hydromorphone in the treatment of intractable postherpetic neuralgia: A single-center, double-blinded, randomized controlled, prospective, and non-inferiority study
    Sun, Yiping
    Shen, Jiayi
    Hei, Guang
    Yun, Ji
    Ma, Bingjie
    Huang, Xuehua
    Yu, Zhiyuan
    Ma, Pingchuan
    Ke, Ma
    FRONTIERS IN PHARMACOLOGY, 2022, 13