Mitigating bias in deep learning: training unbiased models on biased data for the morphological classification of galaxies

被引:0
|
作者
Medina-Rosales, Esteban [1 ,2 ]
Cabrera-Vives, Guillermo [1 ,2 ,3 ,4 ]
Miller, Christopher J. [5 ]
机构
[1] Univ Concepcion, Dept Comp Sci, Edmundo Larenas 219, Concepcion 4070409, Chile
[2] Univ Concepcion, Data Sci Unit, Edmundo Larenas 310, Concepcion 4070415, Chile
[3] Millennium Nucleus Young Exoplanets & Moons YEMS, Concepcion, Chile
[4] Millenium Inst Astrophys MAS, Ave Vicuna Mackenna 4860, Santiago 8331150, Chile
[5] Univ Michigan, Dept Astron, 1085 S Univ, Ann Arbor, MI 48109 USA
关键词
methods: data analysis; methods: statistical; galaxies: statistics; STAR-FORMATION; DATA RELEASE; ZOO; CATALOG; CANDELS;
D O I
10.1093/mnras/stae1088
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Galaxy morphologies and their relation with physical properties have been a relevant subject of study in the past. Most galaxy morphology catalogues have been labelled by human annotators or by machine learning models trained on human-labelled data. Human-generated labels have been shown to contain biases in terms of the observational properties of the data, such as image resolution. These biases are independent of the annotators, that is, are present even in catalogues labelled by experts. In this work, we demonstrate that training deep learning models on biased galaxy data produces biased models, meaning that the biases in the training data are transferred to the predictions of the new models. We also propose a method to train deep learning models that considers this inherent labelling bias, to obtain a de-biased model even when training on biased data. We show that models trained using our deep de-biasing method are capable of reducing the bias of human-labelled data sets.
引用
收藏
页码:52 / 60
页数:9
相关论文
共 50 条
  • [1] DATA AUGMENTATION IN TRAINING DEEP LEARNING MODELS FOR MALWARE FAMILY CLASSIFICATION
    Ding Yuxin
    Wang Guangbin
    Ma Yubin
    Ding Haoxuan
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2021, : 102 - 107
  • [2] Learning Not to Learn: Training Deep Neural Networks with Biased Data
    Kim, Byungju
    Kim, Hyunwoo
    Kim, Kyungsu
    Kim, Sungjin
    Kim, Junmo
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9004 - 9012
  • [3] Transfer learning for collaborative recommendation with biased and unbiased data
    Lin, Zinan
    Liu, Dugang
    Pan, Weike
    Yang, Qiang
    Ming, Zhong
    ARTIFICIAL INTELLIGENCE, 2023, 324
  • [4] Mitigating Bias in Clinical Machine Learning Models
    Julio C. Perez-Downes
    Andrew S. Tseng
    Keith A. McConn
    Sara M. Elattar
    Olayemi Sokumbi
    Ronnie A. Sebro
    Megan A. Allyse
    Bryan J. Dangott
    Rickey E. Carter
    Demilade Adedinsewo
    Current Treatment Options in Cardiovascular Medicine, 2024, 26 : 29 - 45
  • [5] Mitigating Bias in Clinical Machine Learning Models
    Perez-Downes, Julio C.
    Tseng, Andrew S.
    McConn, Keith A.
    Elattar, Sara M.
    Sokumbi, Olayemi
    Sebro, Ronnie A.
    Allyse, Megan A.
    Dangott, Bryan J.
    Carter, Rickey E.
    Adedinsewo, Demilade
    CURRENT TREATMENT OPTIONS IN CARDIOVASCULAR MEDICINE, 2024, 26 (03) : 29 - 45
  • [6] Unbiased training framework on deep reinforcement learning
    Zhang, Huihui
    COMPUTER JOURNAL, 2025,
  • [7] Learning unbiased classifiers from biased data with meta-learning
    Ragonesi, Ruggero
    Morerio, Pietro
    Murino, Vittorio
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2023, : 1 - 9
  • [8] Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning
    Hong, Youngkyu
    Yang, Eunho
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [9] Data augmentation based morphological classification of galaxies using deep convolutional neural network
    Ansh Mittal
    Anu Soorya
    Preeti Nagrath
    D. Jude Hemanth
    Earth Science Informatics, 2020, 13 : 601 - 617
  • [10] Data augmentation based morphological classification of galaxies using deep convolutional neural network
    Mittal, Ansh
    Soorya, Anu
    Nagrath, Preeti
    Hemanth, D. Jude
    EARTH SCIENCE INFORMATICS, 2020, 13 (03) : 601 - 617