Analysis of the Impact of Sample Size, Attribute Variance and Within-Sample Choice Distribution on the Estimation Accuracy of Multinomial Logit Models Using Simulated Data

被引：0

作者：

Minhui Zeng

Ming Zhong

John Douglas Hunt

机构：

[1] Wuhan University of Technology,Engineering Research Center for Transportation safety of MOE

[2] Changsha University of Science & Technology,School of Traffic and Transportation Engineering

[3] National Engineering Research Center for Water Transportation Safety,Department of Civil and Environmental Engineering

[4] University of Waterloo,Department of Civil Engineering

[5] University of Calgary,undefined

来源：

Journal of Systems Science and Systems Engineering | 2018年 / 27卷

关键词：

Sample size; attribute variance; within-sample choice distribution; simulated data;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Literature review indicates that sample size, attribute variance and within-sample choice distribution of alternatives are important considerations in the estimation of multinomial logit (MNL) models, but their impacts on the estimation accuracy have not been systematically studied. Therefore, the objective of this paper is to provide an empirical examination to the above issues through a set of simulated discrete choice preference and rank ordered preference datasets. In this paper, the utility coefficients, alternative specific constants (ASCs), and the mean and standard deviation of the four attributes for a set of seven hypothetical alternatives are specified as a priori. Then, synthetic datasets, with varying sample size, attribute variance and within-sample choice distribution are simulated. Based on these datasets, the utility coefficients and ASCs of the specified MNLs are re-estimated and compared with the original values specified as the priori. It is found that (1) the estimation accuracy of utility parameters increases as the sample size increases; (2) the utility coefficients can be re-estimated with reasonable accuracy, but the estimates of the ASCs are confronted with much larger errors; (3) as the variances of the alternative attributes increase, the estimation accuracy improves significantly; and (4) as the distribution of chosen choices becomes more balanced across alternatives within sample datasets, the hit-ratio decreases. The results indicate that (a) under a similar setting presented in this paper, a large sample consisting of a few thousand observations (3000–4000) may be needed in order to provide reasonable estimates for utility coefficients, particularly for ASCs; (b) a larger, but realistic attribute space is preferred in the stated preference survey design; and (c) choice datasets with unbalanced “chosen” choice frequency distribution is preferred, in order to better capture the elasticity between the “perceived utility” associated with alternative’s attributes.

引用

页码：771 / 789

页数：18

共 9 条

[1] Analysis of the Impact of Sample Size, Attribute Variance and Within-Sample Choice Distribution on the Estimation Accuracy of Multinomial Logit Models Using Simulated Data
Zeng, Minhui
Zhong, Ming
Hunt, John Douglas
[J]. JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2018, 27 (06) : 771 - 789
[2] Effect of Within-Sample Choice Distribution and Sample Size on the Estimation Accuracy of Logit Model
Zeng, Minhui
Zhong, Ming
Hunt, John Douglas
[J]. 3RD INTERNATIONAL CONFERENCE ON TRANSPORTATION INFORMATION AND SAFETY (ICTIS 2015), 2015, : 304 - +
[3] ESTIMATION OF EFFECTIVE SAMPLE SIZE FOR CATCH-AT-AGE AND CATCH-AT-LENGTH DATA USING SIMULATED DATA FROM THE DIRICHLET-MULTINOMIAL DISTRIBUTION
Candy, S. G.
[J]. CCAMLR SCIENCE, 2008, 15 : 115 - 138
[4] Bayesian estimation of discrete choice models: a comparative analysis using effective sample size
Hawkins, Jason
Habib, Khandker Nurul
[J]. TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2022, 14 (10): : 1091 - 1099
[5] Sample size estimation for recurrent event data using multifrailty and multilevel survival models
Dinart, Derek
Bellera, Carine
Rondeau, Virginie
[J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2024,
[6] Model-based estimates of effective sample size in stock assessment models using the Dirichlet-multinomial distribution
Thorson, James T.
Johnson, Kelli F.
Methot, Richard D.
Taylor, Ian G.
[J]. FISHERIES RESEARCH, 2017, 192 : 84 - 93
[7] EFFECT SIZE ESTIMATION FOR ONE-SAMPLE MULTIPLE-CHOICE-TYPE DATA - DESIGN, ANALYSIS, AND META-ANALYSIS
ROSENTHAL, R
RUBIN, DB
[J]. PSYCHOLOGICAL BULLETIN, 1989, 106 (02) : 332 - 337
[8] EFFECT SIZE ESTIMATION FOR ONE-SAMPLE MULTIPLE-CHOICE-TYPE DATA - DESIGN, ANALYSIS, AND METAANALYSIS - COMMENT
SHAFFER, JP
[J]. PSYCHOLOGICAL BULLETIN, 1991, 109 (02) : 348 - 350
[9] Using an EM covariance matrix to estimate structural equation models with missing data: Choosing an adjusted sample size to improve the accuracy of inferences
Enders, CK
Peugh, JL
[J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2004, 11 (01) : 1 - 19

← 1 →