Diagnosis of Obstructive Sleep Apnea Using Feature Selection, Classification Methods, and Data Grouping Based Age, Sex, and Race

被引:1
|
作者
Sheta, Alaa [1 ]
Thaher, Thaer [2 ]
Surani, Salim R. [3 ]
Turabieh, Hamza [4 ]
Braik, Malik [5 ]
Too, Jingwei [6 ]
Abu-El-Rub, Noor [7 ]
Mafarjah, Majdi [8 ]
Chantar, Hamouda [9 ]
Subramanian, Shyam [10 ]
机构
[1] Southern Connecticut State Univ, Comp Sci Dept, New Haven, CT 06514 USA
[2] Arab Amer Univ, Dept Comp Syst Engn, POB 240, Jenin, Palestine
[3] Texas A&M Univ, Dept Pulm Crit Care & Sleep Med, College Stn, TX 77843 USA
[4] Univ Missouri, Sch Med, Hlth Management & Informat Dept, Columbia, MO 65212 USA
[5] Al Balqa Appl Univ, Dept Comp Sci, Salt 19117, Jordan
[6] Univ Teknikal Malaysia Melaka, Fac Elect Engn, Hang Tuah Jaya, Durian Tunggal 76100, Melaka, Malaysia
[7] Univ Kansas, Ctr Med Informat & Enterprise Analyt, Med Ctr, Kansas City, KS 66160 USA
[8] Birzeit Univ, Dept Comp Sci, POB 14, Birzeit, Palestine
[9] Sebha Univ, Fac Informat Technol, Sebha 18758, Libya
[10] Sutter Hlth, Pulm Crit Care & Sleep Med, Tracy, CA 95376 USA
关键词
obstructive sleep apnea; grouping; feature selection; machine learning; OPTIMIZATION ALGORITHM; GENDER; HYPERTENSION; MODEL;
D O I
10.3390/diagnostics13142417
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Obstructive sleep apnea (OSA) is a prevalent sleep disorder that affects approximately 3-7% of males and 2-5% of females. In the United States alone, 50-70 million adults suffer from various sleep disorders. OSA is characterized by recurrent episodes of breathing cessation during sleep, thereby leading to adverse effects such as daytime sleepiness, cognitive impairment, and reduced concentration. It also contributes to an increased risk of cardiovascular conditions and adversely impacts patient overall quality of life. As a result, numerous researchers have focused on developing automated detection models to identify OSA and address these limitations effectively and accurately. This study explored the potential benefits of utilizing machine learning methods based on demographic information for diagnosing the OSA syndrome. We gathered a comprehensive dataset from the Torr Sleep Center in Corpus Christi, Texas, USA. The dataset comprises 31 features, including demographic characteristics such as race, age, sex, BMI, Epworth score, M. Friedman tongue position, snoring, and more. We devised a novel process encompassing pre-processing, data grouping, feature selection, and machine learning classification methods to achieve the research objectives. The classification methods employed in this study encompass decision tree (DT), naive Bayes (NB), k-nearest neighbor (kNN), support vector machine (SVM), linear discriminant analysis (LDA), logistic regression (LR), and subspace discriminant (Ensemble) classifiers. Through rigorous experimentation, the results indicated the superior performance of the optimized kNN and SVM classifiers for accurately classifying sleep apnea. Moreover, significant enhancements in model accuracy were observed when utilizing the selected demographic variables and employing data grouping techniques. For instance, the accuracy percentage demonstrated an approximate improvement of 4.5%, 5%, and 10% with the feature selection approach when applied to the grouped data of Caucasians, females, and individuals aged 50 or below, respectively. Furthermore, a comparison with prior studies confirmed that effective data grouping and proper feature selection yielded superior performance in OSA detection when combined with an appropriate classification method. Overall, the findings of this research highlight the importance of leveraging demographic information, employing proper feature selection techniques, and utilizing optimized classification models for accurate and efficient OSA diagnosis.
引用
收藏
页数:28
相关论文
共 50 条