Evaluating Large Language Models in Generating Synthetic HCI Research Data: a Case Study

被引:45
|
作者
Hamalainen, Perttu [1 ]
Tavast, Mikke [1 ]
Kunnari, Anton [2 ]
机构
[1] Aalto Univ, Espoo, Finland
[2] Univ Helsinki, Helsinki, Finland
基金
芬兰科学院;
关键词
User experience; User models; Language models; GPT-3;
D O I
10.1145/3544548.3580688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collecting data is one of the bottlenecks of Human-Computer Interaction (HCI) research. Motivated by this, we explore the potential of large language models (LLMs) in generating synthetic user research data. We use OpenAI's GPT-3 model to generate open-ended questionnaire responses about experiencing video games as art, a topic not tractable with traditional computational user models. We test whether synthetic responses can be distinguished from real responses, analyze errors of synthetic data, and investigate content similarities between synthetic and real data. We conclude that GPT-3 can, in this context, yield believable accounts of HCI experiences. Given the low cost and high speed of LLM data generation, synthetic data should be useful in ideating and piloting new experiments, although any findings must obviously always be validated with real data. The results also raise concerns: if employed by malicious users of crowdsourcing services, LLMs may make crowdsourcing of self-report data fundamentally unreliable.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Risk or Chance? Large Language Models and Reproducibility in HCI Research
    Kosch, Thomas
    Feger, Sebastian
    [J]. Interactions (N.Y.), 2024, 31 (06): : 44 - 49
  • [2] Generating Synthetic Resume Data with Large Language Models for Enhanced Job Description Classification
    Skondras, Panagiotis
    Zervas, Panagiotis
    Tzimas, Giannis
    [J]. FUTURE INTERNET, 2023, 15 (11)
  • [3] Evaluating the Application of Large Language Models in Clinical Research Contexts
    Perlis, Roy H.
    Fihn, Stephan D.
    [J]. JAMA NETWORK OPEN, 2023, 6 (10)
  • [4] A Study Case of Automatic Archival Research and Compilation using Large Language Models
    Guo, Dongsheng
    Yue, Aizhen
    Ning, Fanggang
    Huang, Dengrong
    Chang, Bingxin
    Duan, Qiang
    Zhang, Lianchao
    Chen, Zhaoliang
    Zhang, Zheng
    Zhan, Enhao
    Zhang, Qilai
    Jiang, Kai
    Li, Rui
    Zhao, Shaoxiang
    Wei, Zizhong
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE GRAPH, ICKG, 2023, : 52 - 59
  • [5] Synthetic Replacements for Human Survey Data? The Perils of Large Language Models
    Bisbee, James
    Clinton, Joshua D.
    Dorff, Cassy
    Kenkel, Brenton
    Larson, Jennifer M.
    [J]. POLITICAL ANALYSIS, 2024,
  • [6] Generating Simulated Data with a Large Language Model
    Kerley, Jeffrey
    Anderson, Derek T.
    Buck, Andrew R.
    Alvey, Brendan
    [J]. SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [7] Evaluating Large Language Models for Automated Reporting and Data Systems Categorization: Cross-Sectional Study
    Wu, Qingxia
    Li, Huali
    Wang, Yan
    Bai, Yan
    Wu, Yaping
    Yu, Xuan
    Li, Xiaodong
    Dong, Pei
    Xue, Jon
    Shen, Dinggang
    Wang, Meiyun
    [J]. JMIR MEDICAL INFORMATICS, 2024, 12
  • [8] Generating and Reviewing Programming Codes with Large Language Models A Systematic Mapping Study
    Lins de Albuquerque, Beatriz Ventorini
    Souza da Cunha, Antonio Fernando
    Souza, Leonardo
    Matsui Siqueira, Sean Wolfgand
    dos Santos, Rodrigo Pereira
    [J]. PROCEEDINGS OF THE 20TH BRAZILIAN SYMPOSIUM ON INFORMATIONS SYSTEMS, SBSI 2024, 2024,
  • [9] Generating colloquial radiology reports with large language models
    Tang, Cynthia Crystal
    Nagesh, Supriya
    Fussell, David A.
    Glavis-Bloom, Justin
    Mishra, Nina
    Li, Charles
    Cortes, Gillean
    Hill, Robert
    Zhao, Jasmine
    Gordon, Angellica
    Wright, Joshua
    Troutt, Hayden
    Tarrago, Rod
    Chow, Daniel S.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024,
  • [10] Evaluating large language models for annotating proteins
    Vitale, Rosario
    Bugnon, Leandro A.
    Fenoy, Emilio Luis
    Milone, Diego H.
    Stegmayer, Georgina
    [J]. BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)