Recognizing Sample-Selection Bias in Historical Data

被引:2
|
作者
Zimran, Ariell [1 ,2 ]
机构
[1] Vanderbilt Univ, 221 Kirkland Hall, Nashville, TN 37235 USA
[2] Natl Bur Econ Res, Cambridge, MA 02138 USA
关键词
AMERICAN; HEIGHTS;
D O I
10.1017/ssh.2020.11
中图分类号
K [历史、地理];
学科分类号
06 ;
摘要
Recent research has ignited a debate in social science history over whether and how to draw conclusions for whole populations from sources that describe only select subsets of these populations. The idiosyncratic availability and survival of historical sources create a threat of sample-selection bias-an error that arises when there are systematic differences between the observed sample and the population of interest. This danger is common in studying trends in health as measured by average stature-scholars can often observe these trends only for soldiers and other similar groups; but whether these patterns are representative of those of the broader population is unclear. This article illustrates what simple patterns in a potentially selected sample can be used to recognize the presence of sample-selection bias in a source, and to understand how such bias might affect conclusions drawn from this source. Applying this intuition to the use of military data to describe stature in the antebellum United States, I present several simple empirical exercises based on these patterns. Finally, I use the results of these exercises to describe how sample-selection bias might affect the use of these data in testing for differences in average stature between the Northeast and the Midwest.
引用
收藏
页码:525 / 554
页数:30
相关论文
共 50 条
  • [21] HOUSEHOLD ALCOHOL AND TOBACCO EXPENDITURES IN TURKEY: A SAMPLE-SELECTION SYSTEM APPROACH
    Bilgic, Abdulbaki
    Yen, Steven T.
    [J]. CONTEMPORARY ECONOMIC POLICY, 2015, 33 (03) : 571 - 585
  • [22] Testing for sample-selection bias due to location in the labour-market behaviour of respondents from the British Household Panel Survey
    Crouchley, R
    Oskrochi, G
    [J]. ENVIRONMENT AND PLANNING A-ECONOMY AND SPACE, 2001, 33 (11): : 1963 - 1984
  • [23] Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection
    Liu, Huafeng
    Sheng, Mengmeng
    Sun, Zeren
    Yao, Yazhou
    Hua, Xian-Sheng
    Shen, Heng-Tao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7426 - 7437
  • [24] Monotonicity conditions and inequality imputation for sample-selection and non-response problems
    Lee, MJ
    [J]. ECONOMETRIC REVIEWS, 2005, 24 (02) : 175 - 194
  • [25] A multivariate sample-selection model: Estimating cigarette and alcohol demands with zero observations
    Yen, ST
    [J]. AMERICAN JOURNAL OF AGRICULTURAL ECONOMICS, 2005, 87 (02) : 453 - 466
  • [26] TESTING FOR SAMPLE SELECTION BIAS
    MELINO, A
    [J]. REVIEW OF ECONOMIC STUDIES, 1982, 49 (01): : 151 - 153
  • [27] MODELS FOR SAMPLE SELECTION BIAS
    WINSHIP, C
    MARE, RD
    [J]. ANNUAL REVIEW OF SOCIOLOGY, 1992, 18 : 327 - 350
  • [28] Copula-based maximum-likelihood estimation of sample-selection models
    Hasebe, Takuya
    [J]. STATA JOURNAL, 2013, 13 (03): : 547 - 573
  • [29] Correcting Sample Selection Bias of Historical Digital Trace Data: Inverse Probability Weighting (IPW) and Type II Tobit Model
    Pak, Chankyung
    Cotter, Kelley
    Thorson, Kjerstin
    [J]. COMMUNICATION METHODS AND MEASURES, 2022, 16 (02) : 134 - 155
  • [30] Sample-Selection Method for Arbitrary Fading Emulation Using Mode-Stirred Chambers
    Sanchez-Heredia, Juan D.
    Gruden, Mathias
    Valenzuela-Valdes, Juan F.
    Sanchez-Hernandez, David A.
    [J]. IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2010, 9 : 409 - 412