Data Exclusion in Policy Survey and Questionnaire Data: Aberrant Responses and Missingness

被引:0
|
作者
Hong, Maxwell [1 ]
Carter, Matthew [1 ]
Kim, Casey [1 ]
Cheng, Ying [1 ,2 ]
机构
[1] Univ Notre Dame, Dept Psychol, Notre Dame, IN USA
[2] Univ Notre Dame, Dept Psychol, 390 Corbett Family Hall, Notre Dame, IN 46556 USA
基金
美国国家科学基金会;
关键词
data quality; data preprocessing; missing data; aberrant responses; open science; CARELESS RESPONSES; PERFORMANCE; REGRESSION;
D O I
10.1177/23727322221144650
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Data preprocessing is an integral step prior to analyzing data in psychological science, with implications for its potentially guiding policy. This article reports how psychological researchers address data preprocessing or quality concerns, with a focus on aberrant responses and missing data in self-report measures. 240 articles were sampled from four journals: Psychological Science, Journal of Personality and Social Psychology, Developmental Psychology, and Abnormal Psychology from 2012 to 2018. Nearly half of the studies did not report any missing data treatment (111/240; 46.25%), and if they did, the most common approach was listwise deletion (71/240; 29.6%). Studies that remove data due to missingness removed, on average, 12% of the sample. Likewise, most studies do not report any aberrant responses (194/240; 80%), but if they did, they classified 4% of the sample as suspect. Most studies are either not transparent enough about their data preprocessing steps or may be leveraging suboptimal procedures. Recommendations can improve transparency and data quality.
引用
收藏
页码:11 / 17
页数:7
相关论文
共 50 条
  • [1] SURVEY DATA AND SOCIAL POLICY
    PLATT, J
    [J]. BRITISH JOURNAL OF SOCIOLOGY, 1972, 23 (01): : 77 - 92
  • [2] THE ROLE OF MISSINGNESS IN DAILY DIARY DATA
    Yildiz, Mustafa
    Winstead, Vicki
    Pickering, Carolyn
    [J]. INNOVATION IN AGING, 2023, 7 : 604 - 604
  • [3] EFFECTS OF QUESTIONNAIRE DESIGN ON THE QUALITY OF SURVEY DATA
    SANCHEZ, ME
    [J]. PUBLIC OPINION QUARTERLY, 1992, 56 (02) : 206 - 217
  • [4] Learning from data with structured missingness
    Mitra, Robin
    McGough, Sarah F.
    Chakraborti, Tapabrata
    Holmes, Chris
    Copping, Ryan
    Hagenbuch, Niels
    Biedermann, Stefanie
    Noonan, Jack
    Lehmann, Brieuc
    Shenvi, Aditi
    Doan, Xuan Vinh
    Leslie, David
    Bianconi, Ginestra
    Sanchez-Garcia, Ruben
    Davies, Alisha
    Mackintosh, Maxine
    Andrinopoulou, Eleni-Rosalina
    Basiri, Anahid
    Harbron, Chris
    MacArthur, Ben D.
    [J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (01) : 13 - 23
  • [5] Learning from data with structured missingness
    Robin Mitra
    Sarah F. McGough
    Tapabrata Chakraborti
    Chris Holmes
    Ryan Copping
    Niels Hagenbuch
    Stefanie Biedermann
    Jack Noonan
    Brieuc Lehmann
    Aditi Shenvi
    Xuan Vinh Doan
    David Leslie
    Ginestra Bianconi
    Ruben Sanchez-Garcia
    Alisha Davies
    Maxine Mackintosh
    Eleni-Rosalina Andrinopoulou
    Anahid Basiri
    Chris Harbron
    Ben D. MacArthur
    [J]. Nature Machine Intelligence, 2023, 5 : 13 - 23
  • [6] Time Series Analysis for Longitudinal Survey Data under Informative Sampling and Nonignorable Missingness
    Liu, Zhan
    Yau, Chun Yip
    [J]. REVSTAT-STATISTICAL JOURNAL, 2022, 20 (04) : 405 - 426
  • [7] SURVEY DATA FOR RURAL DEVELOPMENTAL POLICY
    TAYLOR, DG
    [J]. REVIEW OF PUBLIC DATA USE, 1980, 8 (03): : 225 - 235
  • [8] Use of survey data for energy policy
    Natarajan, I
    [J]. BIOMASS ENERGY: KEY ISSUES AND PRIORITY NEEDS, CONFERENCE PROCEEDINGS, 1997, : 89 - 93
  • [9] Identifying Careless Responses in Survey Data
    Meade, Adam W.
    Craig, S. Bartholomew
    [J]. PSYCHOLOGICAL METHODS, 2012, 17 (03) : 437 - 455
  • [10] Analyzing longitudinal clinical trial data with nonignorable missingness and unknown missingness reasons
    Xie, Hui
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (05) : 1287 - 1300