Understanding gender differences in professional European football through machine learning interpretability and match actions data

被引:19
|
作者
Garnica-Caparros, Marc [1 ]
Memmert, Daniel [1 ]
机构
[1] German Sport Univ Cologne, Inst Training & Comp Sci Sport, Sportpk Mungersdorf 6, D-50933 Cologne, Germany
关键词
SOCCER; PERFORMANCE; FEMALE;
D O I
10.1038/s41598-021-90264-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
After the great success of the Women's World Cup in 2019, several platforms have started identifying the reasons for gender inequality in European football. Even though these inequalities emerge from a variety of key aspects in the modern sport, we focused on the game and evaluated the main differential features of European male and female football players in match actions data under the assumption of finding significant differences and established patterns between genders. A methodology for unbiased feature extraction and objective analysis is presented based on data integration and machine learning explainability algorithms. Female (n0=1511) and male (n1=2703) data points were collected from event data and categorized by game period and player position. We set up a supervised classification pipeline to predict the gender of each player by looking at their actions in the game. The comparison methodology did not include any qualitative enrichment or subjective analysis to prevent biased data enhancement or gender-related processing. The pipeline included three representative binary classification models; A logic-based Decision Trees, a probabilistic Logistic Regression and a multilevel perceptron Neural Network. Each model tried to draw the differences between male and female data points, and we extracted the results using machine learning explainability methods to understand the underlying mechanics of the models implemented. The study was able to determine pivotal factors that differentiate each gender performance as well as disseminate unique patterns by gender involving more than one indicator. Data enhancement and critical variables analysis are essential next steps to support this framework and serve as a baseline for further studies and training developments.
引用
收藏
页数:14
相关论文
共 26 条
  • [1] Understanding gender differences in professional European football through machine learning interpretability and match actions data
    Marc Garnica-Caparrós
    Daniel Memmert
    [J]. Scientific Reports, 11
  • [2] Interpreting Interpretability: Understanding Data Scientists' Use of Interpretability Tools for Machine Learning
    Kaur, Harmanpreet
    Nori, Harsha
    Jenkins, Samuel
    Caruana, Rich
    Wallach, Hanna
    Vaughan, Jennifer Wortman
    [J]. PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
  • [3] Factors associated with match outcomes in elite European football - insights from machine learning models
    Settembre, Maxime
    Buchheit, Martin
    Hader, Karim
    Hamill, Ray
    Tarascon, Adrien
    Verheijen, Raymond
    McHugh, Derek
    [J]. JOURNAL OF SPORTS ANALYTICS, 2024, 10 (01) : 1 - 16
  • [4] Understanding Delegation Through Machine Learning: A Method and Application to the European Union
    Anastasopoulos, L. Jason
    Bertelli, Anthony M.
    [J]. AMERICAN POLITICAL SCIENCE REVIEW, 2020, 114 (01) : 291 - 301
  • [5] Predicting transfer fees in professional European football before and during COVID-19 using machine learning
    Yang, Yanxiang
    Koenigstorfer, Joerg
    Pawlowski, Tim
    [J]. EUROPEAN SPORT MANAGEMENT QUARTERLY, 2024, 24 (03) : 603 - 623
  • [6] Understanding destination brand experience through data mining and machine learning
    Calderon-Fajardo, Victor
    Anaya-Sanchez, Rafael
    Molinillo, Sebastian
    [J]. JOURNAL OF DESTINATION MARKETING & MANAGEMENT, 2024, 31
  • [7] Cricket data analytics: Forecasting T20 match winners through machine learning
    Chakraborty, Sanjay
    Mondal, Arnab
    Bhattacharjee, Aritra
    Mallick, Ankush
    Santra, Riju
    Maity, Saikat
    Dey, Lopamudra
    [J]. INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2024, 28 (01) : 73 - 92
  • [8] Understanding Gender Biases and Differences in Web-Based Reviews of Sanctioned Physicians Through a Machine Learning Approach: Mixed Methods Study
    Barnett, Julia
    Bjarnadottir, Margret Vilborg
    Anderson, David
    Chen, Chong
    [J]. JMIR FORMATIVE RESEARCH, 2022, 6 (09)
  • [9] Use of LinkedIn Data and Machine Learning to Analyze Gender Differences in Construction Career Paths
    Hickey, Paul J.
    Erfani, Abdolmajid
    Cui, Qingbin
    [J]. JOURNAL OF MANAGEMENT IN ENGINEERING, 2022, 38 (06)
  • [10] Gender Identification Through Facebook Data Analysis Using Machine Learning Techniques
    Kiratsa, P. I.
    Sidiropoulos, G. K.
    Badeka, E. V.
    Papadopoulou, C. I.
    Nikolaou, A. P.
    Papakostas, G. A.
    [J]. 22ND PAN-HELLENIC CONFERENCE ON INFORMATICS (PCI 2018), 2018, : 117 - 120