Learning representations for image-based profiling of perturbations

被引:7
|
作者
Moshkov, Nikita [1 ]
Bornholdt, Michael [2 ]
Benoit, Santiago [2 ,3 ]
Smith, Matthew [2 ,4 ]
Mcquin, Claire [2 ]
Goodman, Allen [2 ]
Senft, Rebecca A. [2 ]
Han, Yu [2 ]
Babadi, Mehrtash [2 ]
Horvath, Peter [1 ]
Cimini, Beth A. [2 ]
Carpenter, Anne E. [2 ]
Singh, Shantanu [2 ]
Caicedo, Juan C. [2 ,5 ,6 ]
机构
[1] HUN REN Biol Res Ctr, 62 Temesvar Krt, H-6726 Szeged, Hungary
[2] Broad Inst MIT & Harvard, 415 Main St, Cambridge, MA 02141 USA
[3] Carnegie Mellon Univ, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
[4] Harvard Univ, 86 Brattle St, Cambridge, MA 02138 USA
[5] Morgridge Inst Res, 330 N Orchard St, Madison, WI 53715 USA
[6] Univ Wisconsin Madison, Dept Biostat & Med Informat, 1300 Univ Ave, Madison, WI 53706 USA
基金
欧盟地平线“2020”;
关键词
ASSAY;
D O I
10.1038/s41467-024-45999-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Measuring the phenotypic effect of treatments on cells through imaging assays is an efficient and powerful way of studying cell biology, and requires computational methods for transforming images into quantitative data. Here, we present an improved strategy for learning representations of treatment effects from high-throughput imaging, following a causal interpretation. We use weakly supervised learning for modeling associations between images and treatments, and show that it encodes both confounding factors and phenotypic features in the learned representation. To facilitate their separation, we constructed a large training dataset with images from five different studies to maximize experimental diversity, following insights from our causal analysis. Training a model with this dataset successfully improves downstream performance, and produces a reusable convolutional network for image-based profiling, which we call Cell Painting CNN. We evaluated our strategy on three publicly available Cell Painting datasets, and observed that the Cell Painting CNN improves performance in downstream analysis up to 30% with respect to classical features, while also being more computationally efficient. Assessing cell phenotypes in image-based assays requires solid computational methods for transforming images into quantitative data. Here, the authors present a strategy for learning representations of treatment effects from high-throughput imaging, following a causal interpretation.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Learning representations for image-based profiling of perturbations
    Nikita Moshkov
    Michael Bornholdt
    Santiago Benoit
    Matthew Smith
    Claire McQuin
    Allen Goodman
    Rebecca A. Senft
    Yu Han
    Mehrtash Babadi
    Peter Horvath
    Beth A. Cimini
    Anne E. Carpenter
    Shantanu Singh
    Juan C. Caicedo
    Nature Communications, 15
  • [2] Applications in image-based profiling of perturbations
    Caicedo, Juan C.
    Singh, Shantanu
    Carpenter, Anne E.
    CURRENT OPINION IN BIOTECHNOLOGY, 2016, 39 : 134 - 142
  • [3] Capturing cell heterogeneity in representations of cell populations for image-based profiling using contrastive learning
    van Dijk, Robert
    Arevalo, John
    Babadi, Mehrtash
    Carpenter, Anne E.
    Singh, Shantanu
    PLoS Computational Biology, 2024, 20 (11)
  • [4] Learning Image-based Representations for Heart Sound Classification
    Ren, Zhao
    Cummins, Nicholas
    Pandit, Vedhas
    Han, Jing
    Qian, Kun
    Schuller, Bjorn
    DH '18: PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, 2018, : 143 - 147
  • [5] Learning Efficient Representations for Image-Based Patent Retrieval
    Wang, Hongsong
    Zhang, Yuqi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VII, 2024, 14431 : 15 - 26
  • [6] Deep Representation Learning for Image-Based Cell Profiling
    Wei, Wenzhao
    Haidinger, Sacha
    Lock, John
    Meijering, Erik
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 487 - 497
  • [7] Still Image-based Human Activity Recognition with Deep Representations and Residual Learning
    Siyal, Ahsan Raza
    Bhutto, Zuhaibuddin
    Shah, Syed Muhammad Shehram
    Iqbal, Azhar
    Mehmood, Faraz
    Hussain, Ayaz
    Ahmed, Saleem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 471 - 477
  • [8] A scalable framework for image-based material representations
    Franke, Tobias
    Fellner, Dieter W.
    WEB3D 2012, 2012, : 83 - 91
  • [9] Object-based and image-based object representations
    Samet, Hanan
    ACM Comput Surv, 1600, 2 (159-217):
  • [10] Object-based and image-based object representations
    Samet, H
    ACM COMPUTING SURVEYS, 2004, 36 (02) : 159 - 217