Semi-automated categorization of open-ended questions

被引:33
|
作者
Schonlau, Matthias [1 ]
Couper, Mick P. [2 ]
机构
[1] Univ Waterloo, Waterloo, ON N2L 3G1, Canada
[2] Univ Michigan, Ann Arbor, MI 48109 USA
来源
SURVEY RESEARCH METHODS | 2016年 / 10卷 / 02期
关键词
multinomial boosting; gradient boosting; qualitative data; coding; text mining;
D O I
10.18148/srm/2016.v1012.6213
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Text data from open-ended questions in surveys are difficult to analyze and are frequently ignored. Yet open-ended questions are important because they do not constrain respondents' answer choices. Where open-ended questions are necessary, sometimes multiple human coders hand-code answers into one of several categories. At the same time, computer scientists have made impressive advances in text mining that may allow automation of such coding. Automated algorithms do not achieve an overall accuracy high enough to entirely replace humans. We categorize easy-to-categorize text answers of open-ended questions automatically using text mining and multinomial boosting, and hard-to-categorize text answers manually. Expected accuracies guide the choice of the threshold delineating between "easy" and "hard" to code text answers. This approach is illustrated with two examples from open-ended questions related to respondents' advice to a patient in a hypothetical dilemma, and a follow-up probe related to respondents' perception of disclosure/privacy risk. Targeting 80% accuracy, we found that 47%-58% of the data could be categorized automatically in research surveys.
引用
收藏
页码:143 / 152
页数:10
相关论文
共 50 条
  • [1] Semi-automated classification for multi-label open-ended questions
    Gweon, Hyukjun
    Schonlau, Matthias
    Wenemark, Marika
    SURVEY METHODOLOGY, 2020, 46 (02) : 265 - 282
  • [2] Automated Classification for Open-Ended Questions with BERT
    Gweon, Hyukjun
    Schonlau, Matthias
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2024, 12 (02) : 493 - 504
  • [3] CHECKLISTS AND OPEN-ENDED QUESTIONS - REPLY
    EHRLICH, HJ
    RINEHART, JW
    SOCIAL FORCES, 1966, 44 (03) : 420 - 421
  • [4] REALIZING THE PROMISE OF OPEN-ENDED QUESTIONS
    Wasik, Barbara A.
    Hindman, Annemarie H.
    READING TEACHER, 2013, 67 (04): : 302 - 311
  • [5] ARE OPEN-ENDED QUESTIONS WORTH THE EFFORT
    PAYNE, SL
    JOURNAL OF MARKETING RESEARCH, 1965, 2 (04) : 417 - 418
  • [6] Examples in Open-Ended Survey Questions
    Tourangeau, Roger
    Sun, Hanyu
    Conrad, Frederick G.
    Couper, Mick P.
    INTERNATIONAL JOURNAL OF PUBLIC OPINION RESEARCH, 2017, 29 (04) : 690 - 702
  • [7] Open-ended interview questions and saturation
    Weller, Susan C.
    Vickers, Ben
    Bernard, H. Russell
    Blackburn, Alyssa M.
    Borgatti, Stephen
    Gravlee, Clarence C.
    Johnson, Jeffrey C.
    PLOS ONE, 2018, 13 (06):
  • [8] Open-Ended Questions in Web Surveys
    Smyth, Jolene D.
    Dillman, Don A.
    Christian, Leah Melani
    Mcbride, Mallory
    PUBLIC OPINION QUARTERLY, 2009, 73 (02) : 325 - 337
  • [9] The Development of Open-Ended Questions Test to Measure Divergent Thinking Skills The Development of Open-Ended Questions Test
    Murwaningsih, Tri
    Fauziah, Muna
    ACM International Conference Proceeding Series, 2021,
  • [10] Machine Learning and Hebrew NLP for Automated Assessment of Open-Ended Questions in Biology
    Moriah Ariely
    Tanya Nazaretsky
    Giora Alexandron
    International Journal of Artificial Intelligence in Education, 2023, 33 : 1 - 34