Faecal microbiome-based machine learning for multi-class disease diagnosis

被引:0
|
作者
Qi Su
Qin Liu
Raphaela Iris Lau
Jingwan Zhang
Zhilu Xu
Yun Kit Yeoh
Thomas W. H. Leung
Whitney Tang
Lin Zhang
Jessie Q. Y. Liang
Yuk Kam Yau
Jiaying Zheng
Chengyu Liu
Mengjing Zhang
Chun Pan Cheung
Jessica Y. L. Ching
Hein M. Tun
Jun Yu
Francis K. L. Chan
Siew C. Ng
机构
[1] Microbiota I-Center (MagIC),Department of Medicine and Therapeutics
[2] The Chinese University of Hong Kong,Li Ka Shing Institute of Health Sciences, State Key Laboratory of Digestive Disease, Institute of Digestive Disease
[3] The Chinese University of Hong Kong,Center for Gut Microbiota Research
[4] Faculty of Medicine,JC School of Public Health and Primary Care
[5] The Chinese University of Hong Kong,undefined
[6] The Chinese University of Hong Kong,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we present a machine-learning multi-class model using faecal metagenomic dataset of 2,320 individuals with nine well-characterised phenotypes, including colorectal cancer, colorectal adenomas, Crohn’s disease, ulcerative colitis, irritable bowel syndrome, obesity, cardiovascular disease, post-acute COVID-19 syndrome and healthy individuals. Our processed data covers 325 microbial species derived from 14.3 terabytes of sequence. The trained model achieves an area under the receiver operating characteristic curve (AUROC) of 0.90 to 0.99 (Interquartile range, IQR, 0.91–0.94) in predicting different diseases in the independent test set, with a sensitivity of 0.81 to 0.95 (IQR, 0.87–0.93) at a specificity of 0.76 to 0.98 (IQR 0.83–0.95). Metagenomic analysis from public datasets of 1,597 samples across different populations observes comparable predictions with AUROC of 0.69 to 0.91 (IQR 0.79–0.87). Correlation of the top 50 microbial species with disease phenotypes identifies 363 significant associations (FDR < 0.05). This microbiome-based multi-disease model has potential clinical application in disease diagnostics and treatment response monitoring and warrants further exploration.
引用
收藏
相关论文
共 50 条
  • [1] Faecal microbiome-based machine learning for multi-class disease diagnosis
    Su, Qi
    Liu, Qin
    Lau, Raphaela Iris
    Zhang, Jingwan
    Xu, Zhilu
    Yeoh, Yun Kit
    Leung, Thomas W. H.
    Tang, Whitney
    Zhang, Lin
    Liang, Jessie Q. Y.
    Yau, Yuk Kam
    Zheng, Jiaying
    Liu, Chengyu
    Zhang, Mengjing
    Cheung, Chun Pan
    Ching, Jessica Y. L.
    Tun, Hein M.
    Yu, Jun
    Chan, Francis K. L.
    Ng, Siew C.
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [2] Machine learning model for microbiome-based diagnosis of bacterial vaginosis
    Gupta, Somesh
    Challa, Apoorva
    Kachhawa, Garima
    Nagpal, Sunil
    Sood, Seema
    Taneja, Bhupesh
    SEXUALLY TRANSMITTED DISEASES, 2024, 51 (01) : S425 - S425
  • [3] Machine Learning Strategy for Gut Microbiome-Based Diagnostic Screening of Cardiovascular Disease
    Aryal, Sachin
    Alimadadi, Ahmad
    Manandhar, Ishan
    Joe, Bina
    Cheng, Xi
    HYPERTENSION, 2020, 76 (05) : 1555 - 1562
  • [4] Machine Learning-Based Framework for Multi-Class Diagnosis of Neurodegenerative Diseases: A Study on Parkinson's Disease
    Singh, Gurpreet
    Vadera, Meet
    Samavedham, Lakshminarayanan
    Lim, Erle Chuen-Hian
    IFAC PAPERSONLINE, 2016, 49 (07): : 990 - 995
  • [5] Gut microbiome-based supervised machine learning for clinical diagnosis of inflammatory bowel diseases
    Manandhar, Ishan
    Alimadadi, Ahmad
    Aryal, Sachin
    Munroe, Patricia B.
    Joe, Bina
    Cheng, Xi
    AMERICAN JOURNAL OF PHYSIOLOGY-GASTROINTESTINAL AND LIVER PHYSIOLOGY, 2021, 320 (03): : G328 - G337
  • [6] Benchmark of Data Processing Methods and Machine Learning Models for Gut Microbiome-Based Diagnosis of Inflammatory Bowel Disease
    Kubinski, Ryszard
    Djamen-Kepaou, Jean-Yves
    Zhanabaev, Timur
    Hernandez-Garcia, Alex
    Bauer, Stefan
    Hildebrand, Falk
    Korcsmaros, Tamas
    Karam, Sani
    Jantchou, Prevost
    Kafi, Kamran
    Martin, Ryan D.
    FRONTIERS IN GENETICS, 2022, 13
  • [7] Noninvasive, microbiome-based diagnosis of inflammatory bowel disease
    Zheng, Jiaying
    Sun, Qianru
    Zhang, Mengjing
    Liu, Chengyu
    Su, Qi
    Zhang, Lin
    Xu, Zhilu
    Lu, Wenqi
    Ching, Jessica
    Tang, Whitney
    Cheung, Chun Pan
    Hamilton, Amy L.
    O'Brien, Amy L. Wilson
    Wei, Shu Chen
    Bernstein, Charles N.
    Rubin, David T.
    Chang, Eugene B.
    Morrison, Mark
    Kamm, Michael A.
    Chan, Francis K. L.
    Zhang, Jingwan
    Ng, Siew C.
    NATURE MEDICINE, 2024, : 3555 - 3567
  • [8] Microbiome-based Diagnostic Screening Of Cardiovascular Disease Using A Machine Learning Approach.
    Aryal, Sachin
    Alimadadi, Ahmad
    Manandhar, Ishan
    Joe, Bina
    Cheng, Xi
    HYPERTENSION, 2020, 76
  • [9] Power Transformer Fault Diagnosis Based on Multi-class Multi-Kernel Learning Relevance Vector Machine
    Yin, Jinliang
    Zhou, Xuesong
    Ma, Youjie
    Wu, Yanjuan
    Xu, Xiaoning
    2015 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2015, : 217 - 221
  • [10] On the Machine Learning-based Multi-class Classification of Microscopic Colitis
    Tara, Vivek
    Mitra, Dipankar
    Muduganti, Aditi
    Mali, Padmavathi
    Maiti, Srabana
    Dey, Shuvashis
    Gomes, Rahul
    2024 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY, EIT 2024, 2024, : 38 - 43