Biases in machine-learning models of human single-cell data

被引:0
|
作者
Theresa Willem [1 ]
Vladimir A. Shitov [2 ]
Malte D. Luecken [3 ]
Niki Kilbertus [4 ]
Stefan Bauer [3 ]
Marie Piraud [4 ]
Alena Buyx [2 ]
Fabian J. Theis [5 ]
机构
[1] Technical University of Munich,TUM School for Medicine and Health, Institute of History and Ethics in Medicine
[2] Helmholtz Munich,Department of Computational Health, Institute of Computational Biology
[3] Helmholtz Munich,Comprehensive Pneumology Center (CPC) with the CPC
[4] Helmholtz Munich; Member of the German Center for Lung Research (DZL),M bioArchive and Institute of Lung Health and Immunity (LHI)
[5] Technical University of Munich,School for Computation, Information and Technology
[6] Munich Center for Machine Learning (MCML),School of Life Sciences
[7] Technical University of Munich,undefined
关键词
D O I
10.1038/s41556-025-01619-8
中图分类号
学科分类号
摘要
Recent machine-learning (ML)-based advances in single-cell data science have enabled the stratification of human tissue donors at single-cell resolution, promising to provide valuable diagnostic and prognostic insights. However, such insights are susceptible to biases. Here we discuss various biases that emerge along the pipeline of ML-based single-cell analysis, ranging from societal biases affecting whose samples are collected, to clinical and cohort biases that influence the generalizability of single-cell datasets, biases stemming from single-cell sequencing, ML biases specific to (weakly supervised or unsupervised) ML models trained on human single-cell samples and biases during the interpretation of results from ML models. We end by providing methods for single-cell data scientists to assess and mitigate biases, and call for efforts to address the root causes of biases.
引用
收藏
页码:384 / 392
页数:8
相关论文
共 50 条
  • [1] Editorial: Machine Learning and Mathematical Models for Single-Cell Data Analysis
    Ou-Yang, Le
    Zhang, Xiao-Fei
    Zhang, Jiajun
    Chen, Jin
    Wu, Min
    FRONTIERS IN GENETICS, 2022, 13
  • [2] The Trifecta of Single-Cell, Systems-Biology, and Machine-Learning Approaches
    Weiskittel, Taylor M.
    Correia, Cristina
    Yu, Grace T.
    Ung, Choong Yong
    Kaufmann, Scott H.
    Billadeau, Daniel D.
    Li, Hu
    GENES, 2021, 12 (07)
  • [3] New interpretable machine-learning method for single-cell data reveals correlates of clinical response to cancer immunotherapy
    Greene, Evan
    Finak, Greg
    D'Amico, Leonard A.
    Bhardwaj, Nina
    Church, Candice D.
    Morishima, Chihiro
    Ramchurren, Nirasha
    Taube, Janis M.
    Nghiem, Paul T.
    Cheever, Martin A.
    Fling, Steven P.
    Gottardo, Raphael
    PATTERNS, 2021, 2 (12):
  • [4] Machine Learning Approaches to Single-Cell Data Integration and Translation
    Uhler, Caroline
    Shivashankar, G., V
    PROCEEDINGS OF THE IEEE, 2022, 110 (05) : 557 - 576
  • [5] Single-Cell Data Analytics in ScOrange (General Machine Learning)
    Strazar, Martin
    Zagar, Lan
    Kokosar, Jaka
    Tanko, Vesna
    Policar, Pavlin
    Erjavec, Ales
    Pretnar, Ajda
    Staric, Anze
    Menon, Vilas
    Chen, Rui
    Shaulsky, Gad
    Lemire, Andrew
    Parikh, Anup
    Zupan, Blaz
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2019, 2019, 11526 : 425 - 426
  • [6] Fairness in the Eyes of the Data: Certifying Machine-Learning Models
    Segal, Shahar
    Adi, Yossi
    Pinkas, Benny
    Baum, Carsten
    Ganesh, Chaya
    Keshet, Joseph
    AIES '21: PROCEEDINGS OF THE 2021 AAAI/ACM CONFERENCE ON AI, ETHICS, AND SOCIETY, 2021, : 926 - 935
  • [7] Single-cell specific and interpretable machine learning models for sparse scChIP-seq data imputation
    Albrecht, Steffen
    Andreani, Tommaso
    Andrade-Navarro, Miguel A.
    Fontaine, Jean Fred
    PLOS ONE, 2022, 17 (07):
  • [8] Statistical and machine learning methods for immunoprofiling based on single-cell data
    Zhang, Jingxuan
    Li, Jia
    Lin, Lin
    HUMAN VACCINES & IMMUNOTHERAPEUTICS, 2023, 19 (02)
  • [9] Certified Machine-Learning Models
    Damiani, Ernesto
    Ardagna, Claudio A.
    SOFSEM 2020: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2020, 12011 : 3 - 15
  • [10] Machine learning for perturbational single-cell omics
    Ji, Yuge
    Lotfollahi, Mohammad
    Wolf, F. Alexander
    Theis, Fabian J.
    CELL SYSTEMS, 2021, 12 (06) : 522 - 537