MASI enables fast model-free standardization and integration of single-cell transcriptomics data

被引:1
|
作者
Xu, Yang [1 ,4 ]
Kramann, Rafael [2 ]
McCord, Rachel Patton [3 ]
Hayat, Sikander [2 ]
机构
[1] Univ Tennessee, UT ORNL Grad Sch Genome Sci & Technol, Knoxville, TN 37996 USA
[2] Rhein Westfal TH Aachen, Inst Expt Med & Syst Biol, Aachen, Germany
[3] Univ Tennessee, Dept Biochem & Cellular & Mol Biol, Knoxville, TN 37996 USA
[4] Broad Inst & Harvard, Data Sci Platform, Cambridge, MA 02142 USA
关键词
EXPRESSION; ATLAS; LANDSCAPE; IDENTITY; REVEALS;
D O I
10.1038/s42003-023-04820-3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
MASI is a computational pipeline that enables the integration and annotation of large single-cell transcriptomic datasets with limited computational resources. Single-cell transcriptomics datasets from the same anatomical sites generated by different research labs are becoming increasingly common. However, fast and computationally inexpensive tools for standardization of cell-type annotation and data integration are still needed in order to increase research inclusivity. To standardize cell-type annotation and integrate single-cell transcriptomics datasets, we have built a fast model-free integration method, named MASI (Marker-Assisted Standardization and Integration). We benchmark MASI with other well-established methods and demonstrate that MASI outperforms other methods, in terms of integration, annotation, and speed. To harness knowledge from single-cell atlases, we demonstrate three case studies that cover integration across biological conditions, surveyed participants, and research groups, respectively. Finally, we show MASI can annotate approximately one million cells on a personal laptop, making large-scale single-cell data integration more accessible. We envision that MASI can serve as a cheap computational alternative for the single-cell research community.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Domain generalization enables general cancer cell annotation in single-cell and spatial transcriptomics
    Zhixing Zhong
    Junchen Hou
    Zhixian Yao
    Lei Dong
    Feng Liu
    Junqiu Yue
    Tiantian Wu
    Junhua Zheng
    Gaoliang Ouyang
    Chaoyong Yang
    Jia Song
    Nature Communications, 15
  • [22] Domain generalization enables general cancer cell annotation in single-cell and spatial transcriptomics
    Zhong, Zhixing
    Hou, Junchen
    Yao, Zhixian
    Dong, Lei
    Liu, Feng
    Yue, Junqiu
    Wu, Tiantian
    Zheng, Junhua
    Ouyang, Gaoliang
    Yang, Chaoyong
    Song, Jia
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [23] Data denoising with transfer learning in single-cell transcriptomics
    Wang, Jingshu
    Agarwal, Divyansh
    Huang, Mo
    Hu, Gang
    Zhou, Zilu
    Ye, Chengzhong
    Zhang, Nancy R.
    NATURE METHODS, 2019, 16 (09) : 875 - +
  • [24] Data denoising with transfer learning in single-cell transcriptomics
    Jingshu Wang
    Divyansh Agarwal
    Mo Huang
    Gang Hu
    Zilu Zhou
    Chengzhong Ye
    Nancy R. Zhang
    Nature Methods, 2019, 16 : 875 - 878
  • [25] Comprehensive Integration of Single-Cell Data
    Stuart, Tim
    Butler, Andrew
    Hoffman, Paul
    Hafemeister, Christoph
    Papalexi, Efthymia
    Mauck, William M., III
    Hao, Yuhan
    Stoeckius, Marlon
    Smibert, Peter
    Satija, Rahul
    CELL, 2019, 177 (07) : 1888 - +
  • [26] Spotiphy enables single-cell spatial whole transcriptomics across an entire section
    Yang, Jiyuan
    Zheng, Ziqian
    Jiao, Yun
    Yu, Kaiwen
    Bhatara, Sheetal
    Yang, Xu
    Natarajan, Sivaraman
    Zhang, Jiahui
    Pan, Qingfei
    Easton, John
    Yan, Koon-Kiu
    Peng, Junmin
    Liu, Kaibo
    Yu, Jiyang
    NATURE METHODS, 2025, 22 (04)
  • [27] Uncovering intestinal macrophages through the integration of single-cell and spatial transcriptomics
    Garrido-Trigo, Alba
    Veny, Marisol
    Salas, Azucena
    GENES AND IMMUNITY, 2024, 25 (03) : 254 - 255
  • [28] stDiff: a diffusion model for imputing spatial transcriptomics through single-cell transcriptomics
    Li, Kongming
    Li, Jiahao
    Tao, Yuhao
    Wang, Fei
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [29] FlowGrid enables fast clustering of very large single-cell RNA-seq data
    Fang, Xiunan
    Ho, Joshua W. K.
    BIOINFORMATICS, 2022, 38 (01) : 282 - 283
  • [30] A joint deep learning model enables simultaneous batch effect correction, denoising, and clustering in single-cell transcriptomics
    Lakkis, Justin
    Wang, David
    Zhang, Yuanchao
    Hu, Gang
    Wang, Kui
    Pan, Huize
    Ungar, Lyle
    Reilly, Muredach P.
    Li, Xiangjie
    Li, Mingyao
    GENOME RESEARCH, 2021, 31 (10) : 1753 - 1766