Supervised machine learning-based classification of oral malodor based on the microbiota in saliva samples

被引:17
|
作者
Nakano, Yoshio [1 ]
Takeshita, Toru [2 ]
Kamio, Noriaki [2 ]
Shiota, Susumu [2 ]
Shibata, Yukie [2 ]
Suzuki, Nao [3 ]
Yoneda, Masahiro [4 ]
Hirofuji, Takao [3 ]
Yamashita, Yoshihisa [2 ]
机构
[1] Nihon Univ, Sch Dent, Dept Chem, Chuo Ku, Tokyo 1018310, Japan
[2] Kyushu Univ, Fac Dent Sci, Div Oral Hlth Growth & Dev, Sect Prevent & Publ Hlth Dent,Higashi Ku, Fukuoka 8128582, Japan
[3] Fukuoka Dent Coll, Dept Gen Dent, Sect Gen Dent, Sawara Ku, Fukuoka 8140193, Japan
[4] Fukuoka Dent Coll, Ctr Oral Dis, Hakata Ku, Fukuoka 8120011, Japan
关键词
Support vector machines; Neural networks; Oral malodor classification; METHYL MERCAPTAN; DIVERSITY;
D O I
10.1016/j.artmed.2013.12.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: This study presents an effective method of classifying oral malodor from oral microbiota in saliva by using a support vector machine (SVM), an artificial neural network (ANN), and a decision tree. This approach uses concentrations of methyl mercaptan in mouth air as an indicator of oral malodor, and peak areas of terminal restriction fragment (T-RF) length polyrnorphisms (T-RFLPs) of the 16S rRNA gene as data for supervised machine-learning methods, without identifying specific species producing oral malodorous compounds. Methods: 1 6S rRNA genes were amplified from saliva samples from 309 subjects, and T-RFLP analysis was carried out with the DNA fragments. T-RFLP analysis provides information on microbiota consisting of fragment lengths and peak areas corresponding to bacterial strains. The peak area is equivalent to the frequency of a specific fragment when one molecule is selected from terminal fragments. Another frequency is obtained by dividing the number of species-containing samples by the total number of samples. An SVM, an ANN, and a decision tree were trained based on these two frequencies in 308 samples and classified the presence or absence of methyl mercaptan in mouth air from the remaining subject. Results: The proportion that trained SVM expressed as entropy achieved the highest classification accuracy, with a sensitivity of 51.1% and specificity of 95.0%. The ANN and decision tree provided lower classification accuracies, and only classification by the ANN was improved by weighting with entropy from the frequency of appearance in samples, which increased the accuracy to 81.9% with a sensitivity of 60.2% and a specificity of 90.5%. The decision tree showed low classification accuracy under all conditions. Conclusions: Using T-RF proportions and frequencies, models to classify the presence of methyl mercaptan, a volatile sulfur-containing compound that causes oral malodor, were developed. SVM classifiers successfully classified the presence of methyl mercaptan with high specificity, and this classification is expected to be useful for screening saliva for oral malodor before visits to specialist clinics. Classification by a SVM and an ANN does not require the identification of the oral microbiota species responsible for the malodor, and the ANN also does not require the proportions of T-RFs. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 101
页数:5
相关论文
共 50 条
  • [41] Supervised machine learning-based classification scheme to segment the brainstem on MRI in multicenter brain tumor treatment context
    Dolz, Jose
    Laprie, Anne
    Ken, Soleakhena
    Leroy, Henri-Arthur
    Reyns, Nicolas
    Massoptier, Laurent
    Vermandel, Maximilien
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2016, 11 (01) : 43 - 51
  • [42] Supervised machine learning-based classification scheme to segment the brainstem on MRI in multicenter brain tumor treatment context
    Jose Dolz
    Anne Laprie
    Soléakhéna Ken
    Henri-Arthur Leroy
    Nicolas Reyns
    Laurent Massoptier
    Maximilien Vermandel
    International Journal of Computer Assisted Radiology and Surgery, 2016, 11 : 43 - 51
  • [43] Partially supervised classification - Based on weighted unlabeled samples support vector machine
    Liu, ZG
    Shi, WZ
    Li, DR
    Qin, QQ
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 118 - 129
  • [44] Machine learning-based radar waveform classification for cognitive EW
    Adnan Orduyilmaz
    Ersin Yar
    Mehmet Burak Kocamis
    Mahmut Serin
    Murat Efe
    Signal, Image and Video Processing, 2021, 15 : 1653 - 1662
  • [45] Machine learning-based new classification for immune infiltration of gliomas
    Yuan, Feng
    Wang, Yingshuai
    Yuan, Lei
    Ye, Lei
    Hu, Yangchun
    Cheng, Hongwei
    Li, Yan
    PLOS ONE, 2024, 19 (10):
  • [46] A review on machine learning-based approaches for Internet traffic classification
    Salman, Ola
    Elhajj, Imad H.
    Kayssi, Ayman
    Chehab, Ali
    ANNALS OF TELECOMMUNICATIONS, 2020, 75 (11-12) : 673 - 710
  • [47] Machine learning-based classification of petrofacies in fine laminated limestones
    Genesis, Gallileu
    Gomes, Igor F.
    Barbosa, Jose Antonio
    De Araujo, Araly Fabiana L.
    Ramos, Germano Mario S.
    ANAIS DA ACADEMIA BRASILEIRA DE CIENCIAS, 2024, 96 (01): : e20230041
  • [48] Machine Learning-Based Elephant Flow Classification on the First Packet
    Jurkiewicz, Piotr
    Kadziolka, Bartosz
    Kantor, Miroslaw
    Domzal, Jerzy
    Wojcik, Robert
    IEEE ACCESS, 2024, 12 : 105744 - 105760
  • [49] Machine Learning-based Detection and Classification of Walnut Fungi Diseases
    Khan, Muhammad Alyas
    Ali, Mushtaq
    Shah, Mohsin
    Mahmood, Toqeer
    Ahmad, Muneer
    Jhanjhi, N. Z.
    Bhuiyan, Mohammad Arif Sobhan
    Jaha, Emad Sami
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2021, 30 (03): : 771 - 785
  • [50] Machine learning-based radar waveform classification for cognitive EW
    Orduyilmaz, Adnan
    Yar, Ersin
    Kocamis, Mehmet Burak
    Serin, Mahmut
    Efe, Murat
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (08) : 1653 - 1662