Faecal microbiome-based machine learning for multi-class disease diagnosis

被引:0
|
作者
Qi Su
Qin Liu
Raphaela Iris Lau
Jingwan Zhang
Zhilu Xu
Yun Kit Yeoh
Thomas W. H. Leung
Whitney Tang
Lin Zhang
Jessie Q. Y. Liang
Yuk Kam Yau
Jiaying Zheng
Chengyu Liu
Mengjing Zhang
Chun Pan Cheung
Jessica Y. L. Ching
Hein M. Tun
Jun Yu
Francis K. L. Chan
Siew C. Ng
机构
[1] Microbiota I-Center (MagIC),Department of Medicine and Therapeutics
[2] The Chinese University of Hong Kong,Li Ka Shing Institute of Health Sciences, State Key Laboratory of Digestive Disease, Institute of Digestive Disease
[3] The Chinese University of Hong Kong,Center for Gut Microbiota Research
[4] Faculty of Medicine,JC School of Public Health and Primary Care
[5] The Chinese University of Hong Kong,undefined
[6] The Chinese University of Hong Kong,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Systemic characterisation of the human faecal microbiome provides the opportunity to develop non-invasive approaches in the diagnosis of a major human disease. However, shared microbial signatures across different diseases make accurate diagnosis challenging in single-disease models. Herein, we present a machine-learning multi-class model using faecal metagenomic dataset of 2,320 individuals with nine well-characterised phenotypes, including colorectal cancer, colorectal adenomas, Crohn’s disease, ulcerative colitis, irritable bowel syndrome, obesity, cardiovascular disease, post-acute COVID-19 syndrome and healthy individuals. Our processed data covers 325 microbial species derived from 14.3 terabytes of sequence. The trained model achieves an area under the receiver operating characteristic curve (AUROC) of 0.90 to 0.99 (Interquartile range, IQR, 0.91–0.94) in predicting different diseases in the independent test set, with a sensitivity of 0.81 to 0.95 (IQR, 0.87–0.93) at a specificity of 0.76 to 0.98 (IQR 0.83–0.95). Metagenomic analysis from public datasets of 1,597 samples across different populations observes comparable predictions with AUROC of 0.69 to 0.91 (IQR 0.79–0.87). Correlation of the top 50 microbial species with disease phenotypes identifies 363 significant associations (FDR < 0.05). This microbiome-based multi-disease model has potential clinical application in disease diagnostics and treatment response monitoring and warrants further exploration.
引用
收藏
相关论文
共 50 条
  • [21] Online Fault Diagnosis of Photovoltaic Modules Based on Multi-Class Support Vector Machine
    Wang, Lingxiao
    Liu, Jiao
    Guo, Xiaogang
    Yang, Qiang
    Yan, Wenjun
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4569 - 4574
  • [22] The severity prediction of the binary and multi-class cardiovascular disease-A machine learning-based fusion approach
    Kibria, Hafsa Binte
    Matin, Abdul
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2022, 98
  • [23] Applications of multi-class machine learning models to drug design
    Waldman, Marvin
    Lawless, Michael
    Daga, Pankaj
    Clark, Robert
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [24] Multi-class JPEG Steganalysis Using Extreme Learning Machine
    Bhasin, Veenu
    Bedi, Punam
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1948 - 1952
  • [25] Extreme Learning Machine for Multi-class Sentiment Classification of Tweets
    Wang, Zhaoxia
    Parth, Yogesh
    PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 1 - 11
  • [26] AUCμ: A Performance Metric for Multi-Class Machine Learning Models
    Kleiman, Ross S.
    Page, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [27] Stator Current Based Multi-Class Fault Diagnosis of Three Phase Induction Motor using Machine Learning Framework
    Maulik, Saubhik
    Konar, Pratyay
    Chattopadhyay, Paramita
    2022 IEEE 6TH INTERNATIONAL CONFERENCE ON CONDITION ASSESSMENT TECHNIQUES IN ELECTRICAL SYSTEMS, CATCON, 2022, : 225 - 229
  • [28] Prostate Disease Diagnosis from CT Images Using Multi-Class Support Vector Machine
    Abbas, Wafaa A.
    Salman, Salema S.
    Ismael, Ban S.
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (01): : 121 - 127
  • [29] EEG Authentication System Based on One- and Multi-Class Machine Learning Classifiers
    Hernandez-Alvarez, Luis
    Barbierato, Elena
    Caputo, Stefano
    Mucchi, Lorenzo
    Hernandez Encinas, Luis
    SENSORS, 2023, 23 (01)
  • [30] Multi-class Classification of Industrial Fall from Height based on Machine Learning Algorithm
    Koo, Bum Mo
    Kim, Jong Man
    Nam, Ye Jin
    Sung, Dong Jin
    Shim, Jae Woo
    Yang, Su Min
    Kim, Young Ho
    TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2022, 46 (03) : 259 - 265