Teacher Bias and Evaluation Differences in Test Scores: Different Methods for Different Questions

被引:0
|
作者
Delaney, Judith M. [1 ,2 ,3 ]
Devereux, Paul J. [3 ,4 ,5 ,6 ,7 ]
机构
[1] Univ Bath, Bath, England
[2] UCL, London, England
[3] Inst Labor Econ IZA, Bonn, Germany
[4] Univ Coll Dublin, Sch Econ, Dublin, Ireland
[5] Univ Coll Dublin, Geary Inst, Dublin, Ireland
[6] CEPR, London, England
[7] NHH, Bergen, Norway
关键词
GENDER GAPS; DISCRIMINATION; ASSESSMENTS; OUTCOMES; IMPACTS; ABILITY; WOMEN;
D O I
10.1111/obes.12657
中图分类号
F [经济];
学科分类号
02 ;
摘要
We study differences in teacher evaluations of student performance relative to those measured by test scores. While much literature is concerned with estimating various types of teacher biases, we show conceptually that there is no single 'teacher bias' effect. Even if teachers have no group bias, teacher evaluation differences by group masystematically deviate from test score differences if the distribution of test scores differs across groups. Commonly used approaches are not equivalent and can lead to different conclusions as they target different estimands. We demonstrate our findings using Monte Carlo simulations and, using two recent UK cohort surveys, we show that these conceptual issues matter in practice when we evaluate whether teachers are likely to over-estimate female performance in English. Finally, we use the methods to examine an issue of substantive importance, gender differences in teacher perceptions in comparative advantage in English relative to mathematics. Our findings suggest that it is unlikely that teacher misperceptions of comparative advantage by gender are an important cause of the gender gap in STEM.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] TEST FOR EXPERIMENTER BIAS WITH 2 DIFFERENT EXPERIMENTAL TASKS
    SHAMES, M
    MCGINLEY, H
    MCGINLEY, P
    CANADIAN PSYCHOLOGIST-PSYCHOLOGIE CANADIENNE, 1970, 11 (02): : 201 - &
  • [42] Methods to map protein interactions in mammalian cells: different tools to address different questions
    Eyckerman, S
    Tavernier, J
    EUROPEAN CYTOKINE NETWORK, 2002, 13 (03) : 276 - 284
  • [43] Evaluation of Six Different Soil Test Phosphorus Extraction Methods for Relationship with Cranberry
    Davenport, J. R.
    DeMoranville, C.
    Roper, T.
    IX INTERNATIONAL VACCINIUM SYMPOSIUM, 2009, 810 : 627 - 632
  • [44] Mortars with the addition of bacterial spores: Evaluation of porosity using different test methods
    Schwantes-Cezario, Nicole
    Ferreira Nogueira Camargo, Geovana Souza
    do Couto, Alisson Franco
    Porto, Maria Fernanda
    Cremasco, Lucca Vieira
    Andrello, Avacir Casanova
    Toralles, Berenice Martins
    JOURNAL OF BUILDING ENGINEERING, 2020, 30 (30)
  • [45] LACTOSE LOADING - A SIMPLE TEST FOR DETECTING INTESTINAL LACTASE EVALUATION OF DIFFERENT METHODS
    DESAI, HG
    CHITRE, AV
    JEEJEEB.KN
    GASTROENTEROLOGIA, 1967, 108 (04): : 177 - +
  • [46] A COMPARATIVE EVALUATION OF THE SENSITIVITY OF THE LE CELL TEST PERFORMED SIMULTANEOUSLY BY DIFFERENT METHODS
    DUBOIS, EL
    FREEMAN, V
    BLOOD, 1957, 12 (07) : 657 - 670
  • [47] Comparative evaluation of hydrogen peroxide sporicidal efficacy by different standard test methods
    Sadeghi, Simin
    Abdollahi, Soosan
    Tarighi, Parastoo
    Samadi, Nasrin
    IRANIAN JOURNAL OF MICROBIOLOGY, 2020, 12 (02) : 113 - 120
  • [48] Bias and precision of different sampling methods for GPS positions
    Arnaud, M
    Flori, A
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 1998, 64 (06): : 597 - 600
  • [49] Evaluation of the Generalizability of the Number of Abnormal Scores and the Overall Test Battery Mean as Measures of Performance Validity to a Different Test Battery
    Silk-Eglit, Graham M.
    Miele, Andrea S.
    Stenclik, Jessica H.
    Lynch, Julie K.
    McCaffrey, Robert J.
    APPLIED NEUROPSYCHOLOGY-ADULT, 2015, 22 (06) : 399 - 406
  • [50] Linking scores derived under different modes of test administration
    Eignor, Daniel R.
    Linking and Aligning Scores and Scales, 2007, : 135 - 159