Analysis of the Impact of Sample Size, Attribute Variance and Within-Sample Choice Distribution on the Estimation Accuracy of Multinomial Logit Models Using Simulated Data

被引:0
|
作者
Minhui Zeng
Ming Zhong
John Douglas Hunt
机构
[1] Wuhan University of Technology,Engineering Research Center for Transportation safety of MOE
[2] Changsha University of Science & Technology,School of Traffic and Transportation Engineering
[3] National Engineering Research Center for Water Transportation Safety,Department of Civil and Environmental Engineering
[4] University of Waterloo,Department of Civil Engineering
[5] University of Calgary,undefined
关键词
Sample size; attribute variance; within-sample choice distribution; simulated data;
D O I
暂无
中图分类号
学科分类号
摘要
Literature review indicates that sample size, attribute variance and within-sample choice distribution of alternatives are important considerations in the estimation of multinomial logit (MNL) models, but their impacts on the estimation accuracy have not been systematically studied. Therefore, the objective of this paper is to provide an empirical examination to the above issues through a set of simulated discrete choice preference and rank ordered preference datasets. In this paper, the utility coefficients, alternative specific constants (ASCs), and the mean and standard deviation of the four attributes for a set of seven hypothetical alternatives are specified as a priori. Then, synthetic datasets, with varying sample size, attribute variance and within-sample choice distribution are simulated. Based on these datasets, the utility coefficients and ASCs of the specified MNLs are re-estimated and compared with the original values specified as the priori. It is found that (1) the estimation accuracy of utility parameters increases as the sample size increases; (2) the utility coefficients can be re-estimated with reasonable accuracy, but the estimates of the ASCs are confronted with much larger errors; (3) as the variances of the alternative attributes increase, the estimation accuracy improves significantly; and (4) as the distribution of chosen choices becomes more balanced across alternatives within sample datasets, the hit-ratio decreases. The results indicate that (a) under a similar setting presented in this paper, a large sample consisting of a few thousand observations (3000–4000) may be needed in order to provide reasonable estimates for utility coefficients, particularly for ASCs; (b) a larger, but realistic attribute space is preferred in the stated preference survey design; and (c) choice datasets with unbalanced “chosen” choice frequency distribution is preferred, in order to better capture the elasticity between the “perceived utility” associated with alternative’s attributes.
引用
收藏
页码:771 / 789
页数:18
相关论文
共 9 条
  • [1] Analysis of the Impact of Sample Size, Attribute Variance and Within-Sample Choice Distribution on the Estimation Accuracy of Multinomial Logit Models Using Simulated Data
    Zeng, Minhui
    Zhong, Ming
    Hunt, John Douglas
    [J]. JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2018, 27 (06) : 771 - 789
  • [2] Effect of Within-Sample Choice Distribution and Sample Size on the Estimation Accuracy of Logit Model
    Zeng, Minhui
    Zhong, Ming
    Hunt, John Douglas
    [J]. 3RD INTERNATIONAL CONFERENCE ON TRANSPORTATION INFORMATION AND SAFETY (ICTIS 2015), 2015, : 304 - +
  • [3] ESTIMATION OF EFFECTIVE SAMPLE SIZE FOR CATCH-AT-AGE AND CATCH-AT-LENGTH DATA USING SIMULATED DATA FROM THE DIRICHLET-MULTINOMIAL DISTRIBUTION
    Candy, S. G.
    [J]. CCAMLR SCIENCE, 2008, 15 : 115 - 138
  • [4] Bayesian estimation of discrete choice models: a comparative analysis using effective sample size
    Hawkins, Jason
    Habib, Khandker Nurul
    [J]. TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2022, 14 (10): : 1091 - 1099
  • [5] Sample size estimation for recurrent event data using multifrailty and multilevel survival models
    Dinart, Derek
    Bellera, Carine
    Rondeau, Virginie
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2024,
  • [6] Model-based estimates of effective sample size in stock assessment models using the Dirichlet-multinomial distribution
    Thorson, James T.
    Johnson, Kelli F.
    Methot, Richard D.
    Taylor, Ian G.
    [J]. FISHERIES RESEARCH, 2017, 192 : 84 - 93
  • [7] EFFECT SIZE ESTIMATION FOR ONE-SAMPLE MULTIPLE-CHOICE-TYPE DATA - DESIGN, ANALYSIS, AND META-ANALYSIS
    ROSENTHAL, R
    RUBIN, DB
    [J]. PSYCHOLOGICAL BULLETIN, 1989, 106 (02) : 332 - 337
  • [9] Using an EM covariance matrix to estimate structural equation models with missing data: Choosing an adjusted sample size to improve the accuracy of inferences
    Enders, CK
    Peugh, JL
    [J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2004, 11 (01) : 1 - 19