Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding

被引：53

作者：

Xiao, Ziang ^{[1
]}

Yuan, Xingdi ^{[1
]}

Liao, Q. Vera ^{[1
]}

Abdelghani, Rania ^{[2
]}

Oudeyer, Pierre-Yves ^{[2
]}

机构：

[1] Microsoft Res, Montreal, PQ, Canada

[2] INRIA, Paris, France

来源：

COMPANION PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 COMPANION | 2023年

关键词：

Qualitative Analysis; Deductive Coding; Large Language Model; GPT-3;

D O I：

10.1145/3581754.3584136

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Qualitative analysis of textual contents unpacks rich and valuable information by assigning labels to the data. However, this process is often labor-intensive, particularly when working with large datasets. While recent AI-based tools demonstrate utility, researchers may not have readily available AI resources and expertise, let alone be challenged by the limited generalizability of those task-specific models. In this study, we explored the use of large language models (LLMs) in supporting deductive coding, a major category of qualitative analysis where researchers use pre-determined code-books to label the data into a fixed set of codes. Instead of training task-specific models, a pre-trained LLM could be used directly for various tasks without fine-tuning through prompt learning. Using a curiosity-driven questions coding task as a case study, we found, by combining GPT-3 with expert-drafted codebooks, our proposed approach achieved fair to substantial agreements with expert-coded results. We lay out challenges and opportunities in using LLMs to support qualitative coding and beyond.

引用

页码：75 / 78

页数：4

共 48 条

[1] Playing Games with Ais: The Limits of GPT-3 and Similar Large Language Models
Adam Sobieszek
Tadeusz Price
Minds and Machines, 2022, 32 : 341 - 364
[2] Playing Games with Ais: The Limits of GPT-3 and Similar Large Language Models
Sobieszek, Adam
Price, Tadeusz
MINDS AND MACHINES, 2022, 32 (02) : 341 - 364
[3] A Loosely Wittgensteinian Conception of the Linguistic Understanding of Large Language Models like BERT, GPT-3, and ChatGPT
Gubelmann, Reto
GRAZER PHILOSOPHISCHE STUDIEN-INTERNATIONAL JOURNAL FOR ANALYTIC PHILOSOPHY, 2023, 99 (04): : 485 - 523
[4] From BERT to GPT-3 Codex: Harnessing the Potential of Very Large Language Models for Data Management
Trummer, Immanuel
PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (12): : 3770 - 3773
[5] Feature-based detection of automated language models: tackling GPT-2, GPT-3 and Grover
Froehling, Leon
Zubiaga, Arkaitz
PEERJ COMPUTER SCIENCE, 2021,
[6] Feature-based detection of automated language models: Tackling GPT-2, GPT-3 and Grover
Fröhling L.
Zubiaga A.
PeerJ Computer Science, 2021, 7 : 1 - 23
[7] Thus spoke GPT-3: Interviewing a large-language model on climate finance
Leippold, Markus
FINANCE RESEARCH LETTERS, 2023, 53
[8] Does GPT-3 Grasp Metaphors? Identifying Metaphor Mappings with Generative Language Models
Wachowiak, Lennart
Gromann, Dagmar
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 1018 - 1032
[9] Chat2VIS: Generating Data Visualizations via Natural Language Using ChatGPT, Codex and GPT-3 Large Language Models
Maddigan, Paula
Susnjak, Teo
IEEE ACCESS, 2023, 11 : 45181 - 45193
[10] Towards Combining the Cognitive Abilities of Large Language Models with the Rigor of Deductive Progam Verification
Beckert, Bernhard
Klamroth, Jonas
Pfeifer, Wolfram
Roeper, Patrick
Teuber, Samuel
LEVERAGING APPLICATIONS OF FORMAL METHODS, VERIFICATION AND VALIDATION: SOFTWARE ENGINEERING METHODOLOGIES, PT IV, ISOLA 2024, 2025, 15222 : 242 - 257

← 1 2 3 4 5 →