Teacher Bias and Evaluation Differences in Test Scores: Different Methods for Different Questions

被引:0
|
作者
Delaney, Judith M. [1 ,2 ,3 ]
Devereux, Paul J. [3 ,4 ,5 ,6 ,7 ]
机构
[1] Univ Bath, Bath, England
[2] UCL, London, England
[3] Inst Labor Econ IZA, Bonn, Germany
[4] Univ Coll Dublin, Sch Econ, Dublin, Ireland
[5] Univ Coll Dublin, Geary Inst, Dublin, Ireland
[6] CEPR, London, England
[7] NHH, Bergen, Norway
关键词
GENDER GAPS; DISCRIMINATION; ASSESSMENTS; OUTCOMES; IMPACTS; ABILITY; WOMEN;
D O I
10.1111/obes.12657
中图分类号
F [经济];
学科分类号
02 ;
摘要
We study differences in teacher evaluations of student performance relative to those measured by test scores. While much literature is concerned with estimating various types of teacher biases, we show conceptually that there is no single 'teacher bias' effect. Even if teachers have no group bias, teacher evaluation differences by group masystematically deviate from test score differences if the distribution of test scores differs across groups. Commonly used approaches are not equivalent and can lead to different conclusions as they target different estimands. We demonstrate our findings using Monte Carlo simulations and, using two recent UK cohort surveys, we show that these conceptual issues matter in practice when we evaluate whether teachers are likely to over-estimate female performance in English. Finally, we use the methods to examine an issue of substantive importance, gender differences in teacher perceptions in comparative advantage in English relative to mathematics. Our findings suggest that it is unlikely that teacher misperceptions of comparative advantage by gender are an important cause of the gender gap in STEM.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Evaluation of the Different Test Methods of the Concrete Durability for the Persian Gulf Environment
    Hedayat, Amir Ahmad
    Baniasadizade, Maryam
    ADVANCES IN STRUCTURAL ENGINEERING, 2015, 18 (10) : 1575 - 1586
  • [32] Testing the durability of timber above ground: evaluation of different test methods
    Meyer-Veltrup, Linda
    Brischke, Christian
    Kallander, Bjorn
    EUROPEAN JOURNAL OF WOOD AND WOOD PRODUCTS, 2017, 75 (03) : 291 - 304
  • [33] Cable shielding test methods - A comparison of different test methods
    Mueller, Joachim
    2007 IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY: WORKSHOP AND TUTORIAL NOTES, VOLS 1-3, 2007, : 1178 - 1183
  • [34] BIOLOGICAL EVALUATION OF DENTAL RESTORATIVE MATERIALS - A COMPARISON OF DIFFERENT TEST METHODS
    WENNBERG, A
    MJOR, IA
    HENSTENPETTERSEN, A
    JOURNAL OF BIOMEDICAL MATERIALS RESEARCH, 1983, 17 (01): : 23 - 36
  • [35] Evaluation of the textural properties of melon flesh by different texture test methods
    Liu L.
    Gao X.
    Hua D.
    Liu X.
    Li Z.
    Zhang P.
    Li S.
    Zhang S.
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2016, 49 (08): : 875 - 881
  • [36] Theoretical basis and computational methods for different test-day genetic evaluation methods
    Swalve, HH
    JOURNAL OF DAIRY SCIENCE, 2000, 83 (05) : 1115 - 1124
  • [37] Comparing different explanations of the effect of test anxiety on respondents' test scores
    Sommer, Markus
    Arendasy, Martin E.
    INTELLIGENCE, 2014, 42 : 115 - 127
  • [38] The probability of obtaining two statistically different test scores as a test index
    Muller, Jorg M.
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2006, 66 (04) : 601 - 611
  • [39] Effects of different calibration schedules on the test-retest differences of nasalance scores obtained with the nasometer 6450
    Hahm, Jennifer
    Bressmann, Tim
    CLINICAL LINGUISTICS & PHONETICS, 2022, 36 (2-3) : 292 - 300
  • [40] EVALUATION OF BIOAVAILABILITY BY DIFFERENT METHODS
    RITSCHEL, WA
    HUSSAIN, SA
    SCHNEIDER, B
    BETZIEN, G
    KAUFMANN, B
    METHODS AND FINDINGS IN EXPERIMENTAL AND CLINICAL PHARMACOLOGY, 1985, 7 (08): : 439 - 449