Exploring non-invasive biomarkers for pulmonary nodule detection based on salivary microbiomics and machine learning algorithms

被引:0
|
作者
Huang, Chunxia [1 ]
Ma, Qiong [1 ]
Zeng, Xiao [1 ]
He, Jiawei [1 ]
You, Fengming [1 ,2 ]
Fu, Xi [1 ,2 ]
Ren, Yifeng [1 ,2 ]
机构
[1] Hosp Chengdu Univ Tradit Chinese Med, Chengdu 610072, Sichuan, Peoples R China
[2] Hosp Chengdu Univ Tradit Chinese Med, TCM Regulating Metab Dis Key Lab Sichuan Prov, Chengdu 610072, Sichuan, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国博士后科学基金;
关键词
Salivary microbiota; Pulmonary nodule; Machine learning; SHapley additive additive explanations (SHAP); Non-invasive biomarker; PROBABILITY;
D O I
10.1038/s41598-025-95692-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microorganisms are one of the most promising biomarkers for cancer, and the relationship between microorganisms and lung cancer occurrence and development provides significant potential for pulmonary nodule (PN) diagnosis from a microbiological perspective. This study aimed to analyze the salivary microbiota features of patients with PN and assess the potential of the salivary microbiota as a non-invasive PN biomarker. We collected saliva smples from 153 patients with PN and 40 controls. Using 16 S rRNA gene sequencing, differences in alpha- and beta-diversity and community composition between the group with PN and controls were analyzed. Subsequently, specific microbial variables were selected using six models were trained on the selected salivary microbial features. The models were evaluated using metrics, such as the area under the receiver operating characteristic curve (AUC), to identify the best-performing model. Furthermore, the Bayesian optimization algorithm was used to optimize this best-performing model. Finally, the SHapley Additive exPlanations (SHAP) interpretability framework was used to interpret the output of the optimal model and identify potential PN biomarkers. Significant differences in alpha- and beta-diversity were observed between the group with PN and controls. Although the predominant genera were consistent between the groups, significant disparities were observed in their relative abundances. By leveraging the random forest algorithm, ten characteristic microbial variables were identified and incorporated into six models, which effectively facilitated PN diagnosis. The XGBoost model demonstrated the best performance. Further optimization of the XGBoost model resulted in a Bayesian Optimization-based XGBoost (BOXGB) model. Based on the BOXGB model, an online saliva microbiota-based PN prediction platform was developed. Lastly, SHAP analysis suggested Defluviitaleaceae_UCG-011, Aggregatibacter, Oribacterium, Bacillus, and Prevotalla are promising non-invasive PN biomarkers. This study proved salivary microbiota as a non-invasive PN biomarker, expanding the clinical diagnostic approaches for PN.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Non-Invasive Early Diagnosis of Obstructive Lung Diseases Leveraging Machine Learning Algorithms
    Rehman, Mujeeb Ur
    Driss, Maha
    Khakimov, Abdukodir
    Khalid, Sohail
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5681 - 5697
  • [22] Salivary biomarkers in non-invasive oral cancer diagnostics: a comprehensive review
    Vats, Ravina
    Yadav, Pooja
    Bano, Afsareen
    Wadhwa, Sapna
    Bhardwaj, Rashmi
    JOURNAL OF APPLIED ORAL SCIENCE, 2024, 32
  • [23] The Potential of MicroRNAs as Non-Invasive Prostate Cancer Biomarkers: A Systematic Literature Review Based on a Machine Learning Approach
    Bevacqua, Emilia
    Ammirato, Salvatore
    Cione, Erika
    Curcio, Rosita
    Dolce, Vincenza
    Tucci, Paola
    CANCERS, 2022, 14 (21)
  • [24] Salivary miRNAs as non-invasive biomarkers of hepatocellular carcinoma: a pilot study
    Mariam, Arshiya
    Miller-Atkins, Galen
    Moro, Amika
    Rodarte, Alejandro I.
    Siddiqi, Shirin
    Acevedo-Moreno, Lou-Anne
    Brown, J. Mark
    Allende, Daniela S.
    Aucejo, Federico
    Rotroff, Daniel M.
    PEERJ, 2022, 10
  • [25] Non-invasive diagnosis of the solitary pulmonary nodule: which way is right?
    Divisi, Duilio
    Zaccagna, Gino
    De Sanctis, Stefania
    Marella, Antonio
    De Vico, Andrea
    AME SURGICAL JOURNAL, 2023, 3
  • [26] NON-INVASIVE DETECTION OF PULMONARY-HYPERTENSION
    ALGEO, S
    MORRISON, D
    GOLDMAN, S
    OVITT, T
    CLINICAL RESEARCH, 1983, 31 (01): : A2 - A2
  • [27] A non-invasive approach for calcium deficiency detection in pears using machine learning
    Yogesh
    Dubey, Ashwani Kumar
    Rocha, Alvaro
    NEURAL COMPUTING & APPLICATIONS, 2023, 37 (11): : 7609 - 7618
  • [28] A machine learning model for non-invasive detection of atherosclerotic coronary artery aneurysm
    Ali A. Rostam-Alilou
    Marziyeh Safari
    Hamid R. Jarrah
    Ali Zolfagharian
    Mahdi Bodaghi
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 2221 - 2229
  • [29] Non-invasive Diabetes Mellitus Detection System using Machine Learning Techniques
    Prabha, Anju
    Yadav, Jyoti
    Rani, Asha
    Singh, Vijander
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 948 - 953
  • [30] A machine learning model for non-invasive detection of atherosclerotic coronary artery aneurysm
    Rostam-Alilou, Ali A.
    Safari, Marziyeh
    Jarrah, Hamid R.
    Zolfagharian, Ali
    Bodaghi, Mahdi
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (12) : 2221 - 2229