Exploring non-invasive biomarkers for pulmonary nodule detection based on salivary microbiomics and machine learning algorithms

被引:0
|
作者
Huang, Chunxia [1 ]
Ma, Qiong [1 ]
Zeng, Xiao [1 ]
He, Jiawei [1 ]
You, Fengming [1 ,2 ]
Fu, Xi [1 ,2 ]
Ren, Yifeng [1 ,2 ]
机构
[1] Hosp Chengdu Univ Tradit Chinese Med, Chengdu 610072, Sichuan, Peoples R China
[2] Hosp Chengdu Univ Tradit Chinese Med, TCM Regulating Metab Dis Key Lab Sichuan Prov, Chengdu 610072, Sichuan, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国博士后科学基金;
关键词
Salivary microbiota; Pulmonary nodule; Machine learning; SHapley additive additive explanations (SHAP); Non-invasive biomarker; PROBABILITY;
D O I
10.1038/s41598-025-95692-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microorganisms are one of the most promising biomarkers for cancer, and the relationship between microorganisms and lung cancer occurrence and development provides significant potential for pulmonary nodule (PN) diagnosis from a microbiological perspective. This study aimed to analyze the salivary microbiota features of patients with PN and assess the potential of the salivary microbiota as a non-invasive PN biomarker. We collected saliva smples from 153 patients with PN and 40 controls. Using 16 S rRNA gene sequencing, differences in alpha- and beta-diversity and community composition between the group with PN and controls were analyzed. Subsequently, specific microbial variables were selected using six models were trained on the selected salivary microbial features. The models were evaluated using metrics, such as the area under the receiver operating characteristic curve (AUC), to identify the best-performing model. Furthermore, the Bayesian optimization algorithm was used to optimize this best-performing model. Finally, the SHapley Additive exPlanations (SHAP) interpretability framework was used to interpret the output of the optimal model and identify potential PN biomarkers. Significant differences in alpha- and beta-diversity were observed between the group with PN and controls. Although the predominant genera were consistent between the groups, significant disparities were observed in their relative abundances. By leveraging the random forest algorithm, ten characteristic microbial variables were identified and incorporated into six models, which effectively facilitated PN diagnosis. The XGBoost model demonstrated the best performance. Further optimization of the XGBoost model resulted in a Bayesian Optimization-based XGBoost (BOXGB) model. Based on the BOXGB model, an online saliva microbiota-based PN prediction platform was developed. Lastly, SHAP analysis suggested Defluviitaleaceae_UCG-011, Aggregatibacter, Oribacterium, Bacillus, and Prevotalla are promising non-invasive PN biomarkers. This study proved salivary microbiota as a non-invasive PN biomarker, expanding the clinical diagnostic approaches for PN.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Salivary metabolomic biomarkers for non-invasive lung cancer detection
    Kajiwara, Naohiro
    Kakihana, Masatoshi
    Maeda, Junichi
    Kaneko, Miku
    Ota, Sana
    Enomoto, Ayame
    Ikeda, Norihiko
    Sugimoto, Masahiro
    CANCER SCIENCE, 2024, 115 (05) : 1695 - 1705
  • [2] Exploring Non-Invasive Salivary Biomarkers for Acute Pain Diagnostics: A Comprehensive Review
    Stendelyte, Laura
    Malinauskas, Mantas
    Grinkeviciute, Dovile Evalda
    Jankauskaite, Lina
    DIAGNOSTICS, 2023, 13 (11)
  • [3] Non-invasive biomarkers in pulmonary fibrosis
    Prasse, Antje
    Mueller-Quernheim, Joachim
    RESPIROLOGY, 2009, 14 (06) : 788 - 795
  • [4] Near-infrared spectroscopy and machine learning algorithms for rapid and non-invasive detection of Trichuris
    Kariyawasam, Tharanga N.
    Ciocchetta, Silvia
    Visendi, Paul
    Soares Magalhaes, Ricardo J.
    Smith, Maxine E.
    Giacomin, Paul R.
    Sikulu-Lord, Maggy T.
    PLOS NEGLECTED TROPICAL DISEASES, 2023, 17 (11):
  • [5] Non-Invasive Heart Failure Evaluation Using Machine Learning Algorithms
    Victor, Odeh Adeyi
    Chen, Yifan
    Ding, Xiaorong
    SENSORS, 2024, 24 (07)
  • [6] Rapid and non-invasive detection of cystic echinococcosis in sheep based on serum fluorescence spectrum combined with machine learning algorithms
    Xu, Shengke
    Dawuti, Wubulitalifu
    Maimaitiaili, Maierhaba
    Dou, Jingrui
    Aizezi, Malike
    Aimulajiang, Kalibixiati
    Lu, Xiaoyi
    Lu, Guodong
    JOURNAL OF BIOPHOTONICS, 2024, 17 (04)
  • [7] PhotoAgeClock: deep learning algorithms for development of non-invasive visual biomarkers of aging
    Bobrov, Eugene
    Georgievskaya, Anastasia
    Kiselev, Konstantin
    Sevastopolsky, Artem
    Zhavoronkov, Alex
    Gurov, Sergey
    Rudakov, Konstantin
    Tobar, Maria del Pilar Bonilla
    Jaspers, Soren
    Clemann, Sven
    AGING-US, 2018, 10 (11): : 3249 - 3259
  • [8] Non-invasive algorithms for the diagnosis of pulmonary hypertension
    Bonderman, Diana
    Wexberg, Paul
    Heinzl, Harald
    Lang, Irene M.
    THROMBOSIS AND HAEMOSTASIS, 2012, 108 (06) : 1037 - 1041
  • [9] Machine-learning based feature selection for a non-invasive breathing change detection
    Pegoraro, Juliana Alves
    Lavault, Sophie
    Wattiez, Nicolas
    Similowski, Thomas
    Gonzalez-Bermejo, Jesus
    Birmele, Etienne
    BIODATA MINING, 2021, 14 (01)
  • [10] Non-invasive thyroid detection based on electroglottogram signal using machine learning classifiers
    Sai, P. Vijay
    Rajalakshmi, T.
    Snekhalatha, U.
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART H-JOURNAL OF ENGINEERING IN MEDICINE, 2021, 235 (10) : 1128 - 1145