The comparison data forest: A new comparison data approach to determine the number of factors in exploratory factor analysis

被引:2
|
作者
Goretzko, David [1 ,2 ]
Ruscio, John [3 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Psychol, Munich, Germany
[2] Univ Utrecht, Dept Methodol & Stat, Padualaan 14, NL-3584 CH Utrecht, Netherlands
[3] Coll New Jersey, Ewing, NJ USA
关键词
Exploratory factor analysis; Comparison data; Factor retention; Number of factors; Factor forest; Machine learning; COMPONENTS;
D O I
10.3758/s13428-023-02122-4
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
Developing psychological assessment instruments often involves exploratory factor analyses, during which one must determine the number of factors to retain. Several factor-retention criteria have emerged that can infer this number from empirical data. Most recently, simulation-based procedures like the comparison data approach have shown the most accurate estimation of dimensionality. The factor forest, an approach combining extensive data simulation and machine learning modeling, showed even higher accuracy across various common data conditions. Because this approach is very computationally costly, we combine the factor forest and the comparison data approach to present the comparison data forest. In an evaluation study, we compared this new method with the common comparison data approach and identified optimal parameter settings for both methods given various data conditions. The new comparison data forest approach achieved slightly higher overall accuracy, though there were some important differences under certain data conditions. The CD approach tended to underfactor and the CDF tended to overfactor, and their results were also complementary in that for the 81.7% of instances when they identified the same number of factors, these results were correct 96.6% of the time.
引用
收藏
页码:1838 / 1851
页数:14
相关论文
共 50 条
  • [41] One Model to Rule Them All? Using Machine Learning Algorithms to Determine the Number of Factors in Exploratory Factor Analysis
    Goretzko, David
    Buhner, Markus
    PSYCHOLOGICAL METHODS, 2020, 25 (06) : 776 - 786
  • [42] A COMPARISON OF VARIMAX, QUARTIMAX, AND GRAPHICAL ROTATIONS OF FACTORS FOR DATA HAVING A GENERAL FACTOR
    MICHAEL, WB
    TENOPYR, ML
    AMERICAN PSYCHOLOGIST, 1963, 18 (07) : 449 - 449
  • [43] Comparison of data mining methods and logistic regression to determine factors associated with death following injury
    Penny, Kay
    Chesney, Thomas
    DATA ANALYSIS, CLASSIFICATION AND THE FORWARD SEARCH, 2006, : 417 - +
  • [44] CONFIRMATORY FACTOR-ANALYSIS WITH PRETERM NBAS DATA - A COMPARISON OF 4 DATA REDUCTION MODELS
    AZUMA, SD
    MALEE, KM
    KAVANAGH, JA
    DEDDISH, RB
    INFANT BEHAVIOR & DEVELOPMENT, 1991, 14 (02): : 209 - 225
  • [45] A comparison of generalised procrustes analysis and multiple factor analysis for projective mapping data
    Tomic, O.
    Berget, I.
    Naes, T.
    FOOD QUALITY AND PREFERENCE, 2015, 43 : 34 - 46
  • [46] EXPLORATORY DATA-ANALYSIS - A COMPARISON OF STATISTICAL-METHODS WITH ARTIFICIAL NEURAL NETWORKS
    JOSEPH, B
    WANG, FH
    SHIEH, DSS
    COMPUTERS & CHEMICAL ENGINEERING, 1992, 16 (04) : 413 - 423
  • [47] Correlation–Comparison Analysis as a New Way of Data-Mining: Application to Neural Data
    Grbatinić I.
    Krstonošić B.
    Srebro D.
    Purić N.
    Dubak M.
    Dušanić V.
    Kostić V.
    Milošević N.
    SN Computer Science, 4 (5)
  • [48] Exploratory Factor Analysis With Small Samples and Missing Data
    McNeish, Daniel
    JOURNAL OF PERSONALITY ASSESSMENT, 2017, 99 (06) : 637 - 652
  • [49] Exploratory tobit factor analysis for multivariate censored data
    Kamakura, WA
    Wedel, M
    MULTIVARIATE BEHAVIORAL RESEARCH, 2001, 36 (01) : 53 - 82
  • [50] New models for the analysis of paired comparison data;: segmentation of the panel
    Qannari, E
    Courcoux, P
    Séménou, M
    FOOD QUALITY AND PREFERENCE, 2000, 11 (1-2) : 71 - 76