LLAMA: a robust and scalable machine learning pipeline for analysis of large scale 4D microscopy data: analysis of cell ruffles and filopodia

被引:2
|
作者
Lefevre, James G. [1 ]
Koh, Yvette W. H. [1 ]
Wall, Adam A. [1 ]
Condon, Nicholas D. [1 ]
Stow, Jennifer L. [1 ]
Hamilton, Nicholas A. [1 ,2 ]
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld, Australia
[2] Univ Queensland, Res Comp Ctr, Brisbane, Qld, Australia
基金
澳大利亚研究理事会; 英国医学研究理事会;
关键词
Machine learning; Semantic segmentation; High performance computing; Object detection and tracking; Macrophage; Ruffles; Filopodia; IMAGE; MACROPINOCYTOSIS; SEGMENTATION;
D O I
10.1186/s12859-021-04324-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background With recent advances in microscopy, recordings of cell behaviour can result in terabyte-size datasets. The lattice light sheet microscope (LLSM) images cells at high speed and high 3D resolution, accumulating data at 100 frames/second over hours, presenting a major challenge for interrogating these datasets. The surfaces of vertebrate cells can rapidly deform to create projections that interact with the microenvironment. Such surface projections include spike-like filopodia and wave-like ruffles on the surface of macrophages as they engage in immune surveillance. LLSM imaging has provided new insights into the complex surface behaviours of immune cells, including revealing new types of ruffles. However, full use of these data requires systematic and quantitative analysis of thousands of projections over hundreds of time steps, and an effective system for analysis of individual structures at this scale requires efficient and robust methods with minimal user intervention. Results We present LLAMA, a platform to enable systematic analysis of terabyte-scale 4D microscopy datasets. We use a machine learning method for semantic segmentation, followed by a robust and configurable object separation and tracking algorithm, generating detailed object level statistics. Our system is designed to run on high-performance computing to achieve high throughput, with outputs suitable for visualisation and statistical analysis. Advanced visualisation is a key element of LLAMA: we provide a specialised tool which supports interactive quality control, optimisation, and output visualisation processes to complement the processing pipeline. LLAMA is demonstrated in an analysis of macrophage surface projections, in which it is used to i) discriminate ruffles induced by lipopolysaccharide (LPS) and macrophage colony stimulating factor (CSF-1) and ii) determine the autonomy of ruffle morphologies. Conclusions LLAMA provides an effective open source tool for running a cell microscopy analysis pipeline based on semantic segmentation, object analysis and tracking. Detailed numerical and visual outputs enable effective statistical analysis, identifying distinct patterns of increased activity under the two interventions considered in our example analysis. Our system provides the capacity to screen large datasets for specific structural configurations. LLAMA identified distinct features of LPS and CSF-1 induced ruffles and it identified a continuity of behaviour between tent pole ruffling, wave-like ruffling and filopodia deployment.
引用
收藏
页数:26
相关论文
共 40 条
  • [1] LLAMA: a robust and scalable machine learning pipeline for analysis of large scale 4D microscopy data: analysis of cell ruffles and filopodia
    James G. Lefevre
    Yvette W. H. Koh
    Adam A. Wall
    Nicholas D. Condon
    Jennifer L. Stow
    Nicholas A. Hamilton
    [J]. BMC Bioinformatics, 22
  • [2] 4D Catalysis Concept Enabled by Multilevel Data Collection and Machine Learning Analysis
    Galushko, Alexey S.
    Ananikov, Valentine P.
    [J]. ACS CATALYSIS, 2023, 14 (01) : 161 - 175
  • [3] Unsupervised machine learning combined with 4D scanning transmission electron microscopy for bimodal nanostructural analysis
    Kimoto, Koji
    Kikkawa, Jun
    Harano, Koji
    Cretu, Ovidiu
    Shibazaki, Yuki
    Uesugi, Fumihiko
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01):
  • [4] Large-Scale Machine Learning and Optimization for Bioinformatics Data Analysis
    Cheng, Jianlin
    [J]. ACM-BCB 2020 - 11TH ACM CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2020,
  • [5] MimicME: A Large Scale Diverse 4D Database for Facial Expression Analysis
    Papaioannou, Athanasios
    Gecer, Baris
    Cheng, Shiyang
    Chrysos, Grigorios
    Deng, Jiankang
    Fotiadou, Eftychia
    Kampouris, Christos
    Kollias, Dimitrios
    Moschoglou, Stylianos
    Songsri-In, Kritaphat
    Ploumpis, Stylianos
    Trigeorgis, George
    Tzirakis, Panagiotis
    Ververas, Evangelos
    Zhou, Yuxiang
    Ponniah, Allan
    Roussos, Anastasios
    Zafeiriou, Stefanos
    [J]. COMPUTER VISION, ECCV 2022, PT VIII, 2022, 13668 : 467 - 484
  • [6] A hierarchical machine learning framework for the analysis of large scale animal movement data
    Torney, Colin J.
    Morales, Juan M.
    Husmeier, Dirk
    [J]. MOVEMENT ECOLOGY, 2021, 9 (01)
  • [7] A hierarchical machine learning framework for the analysis of large scale animal movement data
    Colin J. Torney
    Juan M. Morales
    Dirk Husmeier
    [J]. Movement Ecology, 9
  • [8] 4DFAB: A Large Scale 4D Database for Facial Expression Analysis and Biometric Applications
    Cheng, Shiyang
    Kotsia, Irene
    Pantic, Maja
    Zafeiriou, Stefanos
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5117 - 5126
  • [9] An online conjugate gradient algorithm for large-scale data analysis in machine learning
    Xue, Wei
    Wan, Pengcheng
    Li, Qiao
    Zhong, Ping
    Yu, Gaohang
    Tao, Tao
    [J]. AIMS MATHEMATICS, 2021, 6 (02): : 1515 - 1537
  • [10] Scalable and Robust Regression Methods for Phenome-Wide Association Analysis on Large-Scale Biobank Data
    Bi, Wenjian
    Lee, Seunggeun
    [J]. FRONTIERS IN GENETICS, 2021, 12