Data Exclusion in Policy Survey and Questionnaire Data: Aberrant Responses and Missingness

被引:0
|
作者
Hong, Maxwell [1 ]
Carter, Matthew [1 ]
Kim, Casey [1 ]
Cheng, Ying [1 ,2 ]
机构
[1] Univ Notre Dame, Dept Psychol, Notre Dame, IN USA
[2] Univ Notre Dame, Dept Psychol, 390 Corbett Family Hall, Notre Dame, IN 46556 USA
基金
美国国家科学基金会;
关键词
data quality; data preprocessing; missing data; aberrant responses; open science; CARELESS RESPONSES; PERFORMANCE; REGRESSION;
D O I
10.1177/23727322221144650
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Data preprocessing is an integral step prior to analyzing data in psychological science, with implications for its potentially guiding policy. This article reports how psychological researchers address data preprocessing or quality concerns, with a focus on aberrant responses and missing data in self-report measures. 240 articles were sampled from four journals: Psychological Science, Journal of Personality and Social Psychology, Developmental Psychology, and Abnormal Psychology from 2012 to 2018. Nearly half of the studies did not report any missing data treatment (111/240; 46.25%), and if they did, the most common approach was listwise deletion (71/240; 29.6%). Studies that remove data due to missingness removed, on average, 12% of the sample. Likewise, most studies do not report any aberrant responses (194/240; 80%), but if they did, they classified 4% of the sample as suspect. Most studies are either not transparent enough about their data preprocessing steps or may be leveraging suboptimal procedures. Recommendations can improve transparency and data quality.
引用
收藏
页码:11 / 17
页数:7
相关论文
共 50 条
  • [31] A Conference (Missingness in Action) to Address Missingness in Data and AI in Health Care: Qualitative Thematic Analysis
    Rose, Christian
    Barber, Rachel
    Preiksaitis, Carl
    Kim, Ireh
    Mishra, Nikesh
    Kayser, Kristen
    Brown, Italo
    Gisondi, Michael
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [32] Migration of Lebanese nurses: A questionnaire survey and secondary data analysis
    El-Jardali, Fadi
    Dumit, Nuhad
    Jamal, Diana
    Mouro, Gladys
    INTERNATIONAL JOURNAL OF NURSING STUDIES, 2008, 45 (10) : 1490 - 1500
  • [33] Disentangling Multidimensional Spatio-Temporal Data into Their Common and Aberrant Responses
    Chang, Young Hwan
    Korkola, James
    Amin, Dhara N.
    Moasser, Mark M.
    Carmena, Jose M.
    Gray, Joe W.
    Tomlin, Claire J.
    PLOS ONE, 2015, 10 (04):
  • [34] The effect of sample size and missingness on inference with missing data
    Morimoto, Julian
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024, 53 (09) : 3292 - 3311
  • [35] Embedding for Informative Missingness: Deep Learning With Incomplete Data
    Ghorbani, Amirata
    Zou, James Y.
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 437 - 445
  • [36] Missing data methods for arbitrary missingness with small samples
    McNeish, Daniel
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (01) : 24 - 39
  • [37] Imputation techniques for multivariate missingness in software measurement data
    Khoshgoftaar, Taghi M.
    Van Hulse, Jason
    SOFTWARE QUALITY JOURNAL, 2008, 16 (04) : 563 - 600
  • [38] A LATENT FACTOR MODEL FOR SPATIAL DATA WITH INFORMATIVE MISSINGNESS
    Reich, Brian J.
    Bandyopadhyay, Dipankar
    ANNALS OF APPLIED STATISTICS, 2010, 4 (01): : 439 - 459
  • [39] A COMPARISON OF ANALYTIC METHODS FOR NONRANDOM MISSINGNESS OF OUTCOME DATA
    CRAWFORD, SL
    TENNSTEDT, SL
    MCKINLAY, JB
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (02) : 209 - 219
  • [40] A classification and discrimination integrated strategy conducted on symbolic data for missing data treatment in questionnaire survey
    Grassia, MG
    New Developments in Classification and Data Analysis, 2005, : 47 - 54