A Bag of Wavelet Features for Snore Sound Classification

被引:0
|
作者
Kun Qian
Maximilian Schmitt
Christoph Janott
Zixing Zhang
Clemens Heiser
Winfried Hohenhorst
Michael Herzog
Werner Hemmert
Björn Schuller
机构
[1] Technische Universität München,Machine Intelligence & Signal Processing Group, MMK
[2] Universität Augsburg,ZD.B Chair of Embedded Intelligence for Health Care & Wellbeing
[3] Technische Universität München,Munich School of Bioengineering
[4] Imperial College London,GLAM – Group on Language, Audio & Music, Department of Computing
[5] audEERING GmbH,Department of Otorhinolaryngology/Head and Neck Surgery, Klinikum rechts der Isar
[6] Technische Universität München,Department of Otorhinolaryngology/Head and Neck Surgery
[7] Alfried Krupp Krankenhaus,Department of Otorhinolaryngology/Head and Neck Surgery
[8] Carl-Thiem-Klinikum Cottbus,undefined
来源
关键词
Snore sound; Obstructive sleep apnea; Drug-induced sleep endoscopy; Wavelets; Bag-of-audio-words;
D O I
暂无
中图分类号
学科分类号
摘要
Snore sound (SnS) classification can support a targeted surgical approach to sleep related breathing disorders. Using machine listening methods, we aim to find the location of obstruction and vibration within a subject’s upper airway. Wavelet features have been demonstrated to be efficient in the recognition of SnSs in previous studies. In this work, we use a bag-of-audio-words approach to enhance the low-level wavelet features extracted from SnS data. A Naïve Bayes model was selected as the classifier based on its superiority in initial experiments. We use SnS data collected from 219 independent subjects under drug-induced sleep endoscopy performed at three medical centres. The unweighted average recall achieved by our proposed method is 69.4%, which significantly (p<0.005,\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p<0.005,$$\end{document} one-tailed z-test) outperforms the official baseline (58.5%), and beats the winner (64.2%) of the INTERSPEECH ComParE Challenge 2017 Snoring sub-challenge. In addition, the conventionally used features like formants, mel-scale frequency cepstral coefficients, subband energy ratios, spectral frequency features, and the features extracted by the openSMILE toolkit are compared with our proposed feature set. The experimental results demonstrate the effectiveness of the proposed method in SnS classification.
引用
收藏
页码:1000 / 1011
页数:11
相关论文
共 50 条
  • [1] A Bag of Wavelet Features for Snore Sound Classification
    Qian, Kun
    Schmitt, Maximilian
    Janott, Christoph
    Zhang, Zixing
    Heiser, Clemens
    Hohenhorst, Winfried
    Herzog, Michael
    Hemmert, Werner
    Schuller, Bjoern
    [J]. ANNALS OF BIOMEDICAL ENGINEERING, 2019, 47 (04) : 1000 - 1011
  • [2] WAVELET FEATURES FOR CLASSIFICATION OF VOTE SNORE SOUNDS
    Qian, Kun
    Janott, Christoph
    Zhang, Zixing
    Heiser, Clemens
    Schuller, Bjoern
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 221 - 225
  • [3] Snore Sound Classification Using Image-based Deep Spectrum Features
    Amiriparian, Shahin
    Gerczuk, Maurice
    Ottl, Sandra
    Cummins, Nicholas
    Freitag, Michael
    Pugachevskiy, Sergey
    Baird, Alice
    Schuller, Bjoern
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3512 - 3516
  • [4] Unsupervised Classification of Respiratory Sound Signal into Snore/No-Snore classes
    Azarbarzin, Ali
    Moussavi, Zahra
    [J]. 2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, : 3666 - 3669
  • [5] Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data
    Zhang, Zixing
    Han, Jing
    Qian, Kun
    Janott, Christoph
    Guo, Yanan
    Schuller, Bjoern
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (01) : 300 - 310
  • [6] GAMMATONE WAVELET FEATURES FOR SOUND CLASSIFICATION IN SURVEILLANCE APPLICATIONS
    Valero, Xavier
    Alias, Francesc
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 1658 - 1662
  • [7] Low Level Texture Features for Snore Sound Discrimination
    Demir, Fatih
    Sengur, Abdulkadir
    Cummins, Nicholas
    Amiriparian, Shahin
    Schuller, Bjoern
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 413 - 416
  • [8] An 'End-to-Evolution' Hybrid Approach for Snore Sound Classification
    Freitag, Michael
    Amiriparian, Shahin
    Cummins, Nicholas
    Gerczuk, Maurice
    Schuller, Bjoern
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3507 - 3511
  • [9] Improving Classification Accuracy of Heart Sound Recordings by Wavelet Filter and Multiple Features
    Wang, Xinpei
    Lie, Yuanyang
    [J]. 2016 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), VOL 43, 2016, 43 : 1149 - 1152
  • [10] Heart Sound Classification from Wavelet Decomposed Signal using Morphological and Statistical Features
    Munia, Tamanna T. K.
    Tavakolian, Kouhyar
    Verma, Ajay K.
    Zakeri, Vahid
    Khosrow-Khavar, Farzad
    Fazel-Rezai, Reza
    Akhbardeh, Alireza
    [J]. 2016 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), VOL 43, 2016, 43 : 597 - 600