Exploring non-invasive biomarkers for pulmonary nodule detection based on salivary microbiomics and machine learning algorithms

被引:0
|
作者
Huang, Chunxia [1 ]
Ma, Qiong [1 ]
Zeng, Xiao [1 ]
He, Jiawei [1 ]
You, Fengming [1 ,2 ]
Fu, Xi [1 ,2 ]
Ren, Yifeng [1 ,2 ]
机构
[1] Hosp Chengdu Univ Tradit Chinese Med, Chengdu 610072, Sichuan, Peoples R China
[2] Hosp Chengdu Univ Tradit Chinese Med, TCM Regulating Metab Dis Key Lab Sichuan Prov, Chengdu 610072, Sichuan, Peoples R China
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
中国博士后科学基金;
关键词
Salivary microbiota; Pulmonary nodule; Machine learning; SHapley additive additive explanations (SHAP); Non-invasive biomarker; PROBABILITY;
D O I
10.1038/s41598-025-95692-6
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microorganisms are one of the most promising biomarkers for cancer, and the relationship between microorganisms and lung cancer occurrence and development provides significant potential for pulmonary nodule (PN) diagnosis from a microbiological perspective. This study aimed to analyze the salivary microbiota features of patients with PN and assess the potential of the salivary microbiota as a non-invasive PN biomarker. We collected saliva smples from 153 patients with PN and 40 controls. Using 16 S rRNA gene sequencing, differences in alpha- and beta-diversity and community composition between the group with PN and controls were analyzed. Subsequently, specific microbial variables were selected using six models were trained on the selected salivary microbial features. The models were evaluated using metrics, such as the area under the receiver operating characteristic curve (AUC), to identify the best-performing model. Furthermore, the Bayesian optimization algorithm was used to optimize this best-performing model. Finally, the SHapley Additive exPlanations (SHAP) interpretability framework was used to interpret the output of the optimal model and identify potential PN biomarkers. Significant differences in alpha- and beta-diversity were observed between the group with PN and controls. Although the predominant genera were consistent between the groups, significant disparities were observed in their relative abundances. By leveraging the random forest algorithm, ten characteristic microbial variables were identified and incorporated into six models, which effectively facilitated PN diagnosis. The XGBoost model demonstrated the best performance. Further optimization of the XGBoost model resulted in a Bayesian Optimization-based XGBoost (BOXGB) model. Based on the BOXGB model, an online saliva microbiota-based PN prediction platform was developed. Lastly, SHAP analysis suggested Defluviitaleaceae_UCG-011, Aggregatibacter, Oribacterium, Bacillus, and Prevotalla are promising non-invasive PN biomarkers. This study proved salivary microbiota as a non-invasive PN biomarker, expanding the clinical diagnostic approaches for PN.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Non-Invasive Machine Learning-Based Classification of Bone Health
    Bhise, Sanvi Pranav
    Havaldar, Raviraj
    TRAITEMENT DU SIGNAL, 2022, 39 (05) : 1695 - 1702
  • [32] Decision support detection system for lung nodule abnormalities based on machine learning algorithms
    Alsallal, Muna
    Sharif, Mhd Saeed
    Hadi, Bydaa
    Albadry, Ruwaida
    JOURNAL OF CONTEMPORARY MEDICAL SCIENCES, 2019, 5 (03): : 165 - 169
  • [33] Non-Invasive Biomarkers for Early Lung Cancer Detection
    Saman, Harman
    Raza, Afsheen
    Patil, Kalyani
    Uddin, Shahab
    Crnogorac-Jurcevic, Tatjana
    CANCERS, 2022, 14 (23)
  • [34] Detection of miRNA as Non-Invasive Biomarkers of Colorectal Cancer
    Ren, Albert
    Dong, Yujuan
    Tsoi, Ho
    Yu, Jun
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (02) : 2810 - 2823
  • [35] Non-Invasive Biomarkers for Early Detection of Breast Cancer
    Li, Jiawei
    Guan, Xin
    Fan, Zhimin
    Ching, Lai-Ming
    Li, Yan
    Wang, Xiaojia
    Cao, Wen-Ming
    Liu, Dong-Xu
    CANCERS, 2020, 12 (10) : 1 - 28
  • [36] Smartphone based non-invasive salivary glucose biosensor
    Soni, Anuradha
    Jha, Sandeep Kumar
    ANALYTICA CHIMICA ACTA, 2017, 996 : 54 - 63
  • [37] Non-invasive prediction mechanism for COVID-19 disease using machine learning algorithms
    Bhardwaj, Arnav
    Agarwal, Hitesh
    Rani, Anuj
    Srivastava, Prakash
    Kumar, Manoj
    Gupta, Sunil
    INTERNATIONAL JOURNAL OF CRITICAL INFRASTRUCTURES, 2024, 20 (02) : 111 - 124
  • [38] Comparing machine learning algorithms for non-invasive detection and classification of failure in piezoresistive bone cement via electrical impedance tomography
    Keiderling, L.
    Rosendorf, J.
    Owens, C. E.
    Varadarajan, K. M.
    Hart, A. J.
    Schwab, J.
    Tallman, T. N.
    Ghaednia, H.
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2023, 94 (12):
  • [39] Rapid and non-invasive diagnosis of hyperkalemia in patients with systolic myocardial failure using a model based on machine learning algorithms
    Torshizi, Hamid M.
    Khorgami, Mohammad R.
    Omidi, Negar
    Khalaj, Fattaneh
    Ahmadi, Mohsen
    JOURNAL OF FAMILY MEDICINE AND PRIMARY CARE, 2024, 13 (08) : 3393 - 3397
  • [40] Non-invasive Jaundice Detection using Machine Vision
    Laddi, Amit
    Kumar, Sanjeev
    Sharma, Shashi
    Kumar, Amod
    IETE JOURNAL OF RESEARCH, 2013, 59 (05) : 591 - 596