Code reviews in open source projects : how do gender biases affect participation and outcomes?

被引:2
|
作者
Sultana, Sayma [1 ]
Turzo, Asif Kamal [1 ]
Bosu, Amiangshu [1 ]
机构
[1] Wayne State Univ, Dept Comp Sci, Detroit, MI 48202 USA
基金
美国国家科学基金会;
关键词
Code review; Diversity and inclusion; Pull requests; Gender bias; OPEN-SOURCE SOFTWARE; REGRESSION-MODELS; PREDICTION; DEVELOPERS; PATCHES;
D O I
10.1007/s10664-023-10324-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
ContextContemporary software development organizations lack diversity, and the ratios of women in Free and Open-Source Software (FOSS) communities are even lower than the industry average. Although the results of recent studies hint at the existence of biases against women, it is unclear to what extent such biases influence the outcomes of various software development tasks.ObjectiveThis study conceptually replicates two recent studies by Terrell et al. and Bosu and Sultana that investigated gender biases in FOSS communities. We aim to identify whether the outcomes of or participation in code reviews (or pull requests) are influenced by the gender of a developer. In particular, we focus on two outcome aspects (i.e., code acceptance, and review interval) and one participation aspect (i.e., code review participation) of code review processes.MethodWith this goal, this study includes a total of 1010 FOSS projects. Ten out of those projects use Gerrit-based code reviews. The remaining 1000 are randomly selected from the GHTorrent dataset based on a stratified sampling of projects fitting certain criteria. We divided GitHub projects into four groups based on the number of distinct contributors. We developed six regression models for each of the 14 datasets (i.e., 10 Gerrit based and 4 GitHub-based) to identify if code acceptance, review intervals, and code review participation differ based on the gender and gender-neutral profile of a developer.ResultsOur results find significant gender biases during code acceptance among 13 out of the 14 datasets, with seven favoring men and the remaining six favoring women. We found significant differences between men and women in terms of code review intervals, with women encountering longer delays than men in three cases and the opposite in seven. Our results indicate reviewer selection as one of the most gender-biased aspects, with 12 out of 14 datasets exhibiting bias. A total of 11 out of the 14 cases show women having significantly lower code review participation than their men colleagues. Since most of the review assignments are based on invitations, this result suggests possible affinity biases among the developers. We also noticed a significantly higher likelihood of women using gender-neutral profiles. Supporting Terrell et al.'s claim, women with gender-neutral profiles had higher odds of code acceptance than men among three Gerrit-based projects. However, contradicting their results, we found significantly lower odds of code acceptance for women with gender-neutral profiles across all four GitHub project groups.ConclusionsThough gender bias exists among many projects, the direction and amplitude of that bias vary based on project size, community, and culture. Similar bias mitigation strategies may not work across all communities, as the characteristics of biases and their underlying causes differ. As women are less likely to be invited for reviews, FOSS projects should take initiatives to ensure the equitable selection of women as reviewers.
引用
收藏
页数:43
相关论文
共 50 条
  • [22] How Do Electoral Gender Quotas Affect Policy?
    Clayton, Amanda
    [J]. ANNUAL REVIEW OF POLITICAL SCIENCE, VOL 24, 2021, 2021, 24 : 235 - 252
  • [23] Who are Source Code Contributors and How do they Change?
    Di Penta, Massimiliano
    German, Daniel M.
    [J]. 16TH WORKING CONFERENCE ON REVERSE ENGINEERING (WCRE 2009), 2009, : 11 - +
  • [24] Students' Engagement in Open Source Projects: An Analysis of Google Summer of Code
    Silva, Jefferson O.
    Wiese, Igor S.
    Steinmacher, Igor
    Gerosa, Marco A.
    [J]. XXXI BRAZILIAN SYMPOSIUM ON SOFTWARE ENGINEERING (SBES 2017), 2017, : 224 - 233
  • [25] How do open source communities blog?
    Dennis Pagano
    Walid Maalej
    [J]. Empirical Software Engineering, 2013, 18 : 1090 - 1124
  • [26] How do open source communities blog?
    Pagano, Dennis
    Maalej, Walid
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2013, 18 (06) : 1090 - 1124
  • [27] On the Nature of Code Cloning in Open-Source Java']Java Projects
    Golubev, Yaroslav
    Bryksin, Timofey
    [J]. 2021 IEEE 15TH INTERNATIONAL WORKSHOP ON SOFTWARE CLONES, IWSC 2021, 2021, : 22 - 28
  • [28] An Empirical Study on the Occurrences of Code Smells in Open Source and Industrial Projects
    Rahman, Md. Masudur
    Satter, Abdus
    Joarder, Md. Mahbubul Alam
    Sakib, Kazi
    [J]. PROCEEDINGS OF THE16TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING AND MEASUREMENT, ESEM 2022, 2022, : 289 - 294
  • [29] A Theory of the Engagement in Open Source Projects via Summer of Code Programs
    Silva, Jefferson
    Wiese, Igor
    German, Daniel M.
    Treude, Christoph
    Gerosa, Marco Aurelio
    Steinmacher, Igor
    [J]. PROCEEDINGS OF THE 28TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE '20), 2020, : 421 - 431
  • [30] Boys lag behind: How teachers' gender biases affect student achievement
    Terrier, Camille
    [J]. ECONOMICS OF EDUCATION REVIEW, 2020, 77