Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding

被引:53
|
作者
Xiao, Ziang [1 ]
Yuan, Xingdi [1 ]
Liao, Q. Vera [1 ]
Abdelghani, Rania [2 ]
Oudeyer, Pierre-Yves [2 ]
机构
[1] Microsoft Res, Montreal, PQ, Canada
[2] INRIA, Paris, France
关键词
Qualitative Analysis; Deductive Coding; Large Language Model; GPT-3;
D O I
10.1145/3581754.3584136
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Qualitative analysis of textual contents unpacks rich and valuable information by assigning labels to the data. However, this process is often labor-intensive, particularly when working with large datasets. While recent AI-based tools demonstrate utility, researchers may not have readily available AI resources and expertise, let alone be challenged by the limited generalizability of those task-specific models. In this study, we explored the use of large language models (LLMs) in supporting deductive coding, a major category of qualitative analysis where researchers use pre-determined code-books to label the data into a fixed set of codes. Instead of training task-specific models, a pre-trained LLM could be used directly for various tasks without fine-tuning through prompt learning. Using a curiosity-driven questions coding task as a case study, we found, by combining GPT-3 with expert-drafted codebooks, our proposed approach achieved fair to substantial agreements with expert-coded results. We lay out challenges and opportunities in using LLMs to support qualitative coding and beyond.
引用
收藏
页码:75 / 78
页数:4
相关论文
共 48 条
  • [21] Operationalizing and Implementing Pretrained, Large Artificial Intelligence Linguistic Models in the US Health Care System: Outlook of Generative Pretrained Transformer 3 (GPT-3) as a Service Model
    Sezgin, Emre
    Sirrianni, Joseph
    Linwood, Simon L.
    JMIR MEDICAL INFORMATICS, 2022, 10 (02)
  • [22] GPT-3-Powered Type Error Debugging: Investigating the Use of Large Language Models for Code Repair
    Ribeiro, Francisco
    Castro de Macedo, Jose Nuno
    Tsushima, Kanae
    Abreu, Rui
    Saraiva, Joao
    PROCEEDINGS OF THE 16TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2023, 2023, : 111 - 124
  • [23] Fine-Tuning Large Language Models for Ontology Engineering: A Comparative Analysis of GPT-4 and Mistral
    Doumanas, Dimitrios
    Soularidis, Andreas
    Spiliotopoulos, Dimitris
    Vassilakis, Costas
    Kotis, Konstantinos
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [24] Transforming online learning research: Leveraging GPT large language models for automated content analysis of cognitive presence
    Castellanos-Reyes, Daniela
    Olesova, Larisa
    Sadaf, Ayesha
    INTERNET AND HIGHER EDUCATION, 2025, 65
  • [25] SCREENING ARTICLES IN A QUALITATIVE LITERATURE REVIEW USING LARGE LANGUAGE MODELS: A COMPARISON OF GPT VERSUS OPEN SOURCE, TRAINED MODELS AGAINST EXPERT RESEARCHER SCREENING
    Hudgens, S.
    Lloyd-Price, L.
    Jafar, R.
    Nourizade, M.
    Burbridge, C.
    Thorlund, K.
    VALUE IN HEALTH, 2024, 27 (06) : S32 - S32
  • [26] Large language models and bariatric surgery patient education: a comparative readability analysis of GPT-3.5, GPT-4, Bard, and online institutional resources
    Srinivasan, Nitin
    Samaan, Jamil S.
    Rajeev, Nithya D.
    Kanu, Mmerobasi U.
    Yeo, Yee Hui
    Samakar, Kamran
    SURGICAL ENDOSCOPY AND OTHER INTERVENTIONAL TECHNIQUES, 2024, 38 (05): : 2522 - 2532
  • [27] Large language models and bariatric surgery patient education: a comparative readability analysis of GPT-3.5, GPT-4, Bard, and online institutional resources
    Nitin Srinivasan
    Jamil S. Samaan
    Nithya D. Rajeev
    Mmerobasi U. Kanu
    Yee Hui Yeo
    Kamran Samakar
    Surgical Endoscopy, 2024, 38 : 2522 - 2532
  • [28] Comparative analysis of large language models in psychiatry and mental health: A focus on GPT, AYA, and Nemotron-3-8B - 8B
    Gargari, Omid Kohandel
    Habibi, Gholamreza
    Nilchian, Nima
    Farzan, Arman Shafiee
    ASIAN JOURNAL OF PSYCHIATRY, 2024, 99
  • [29] Leveraging Large Language Models in Tourism: A Comparative Study of the Latest GPT Omni Models and BERT NLP for Customer Review Classification and Sentiment Analysis
    Roumeliotis, Konstantinos I.
    Tselikas, Nikolaos D.
    Nasiopoulos, Dimitrios K.
    INFORMATION, 2024, 15 (12)
  • [30] How large language models including generative pre-trained transformer (GPT) 3 and 4 will impact medicine and surgery
    Atallah, S. B.
    Banda, N. R.
    Banda, A.
    Roeck, N. A.
    TECHNIQUES IN COLOPROCTOLOGY, 2023, 27 (08) : 609 - 614