Voice Pathology Classification by Using Features from High-Speed Videos

被引:0
|
作者
Voigt, Daniel [1 ]
Doellinger, Michael [1 ]
Yang, Anxiong [1 ]
Eysholdt, Ulrich [1 ]
Lohscheller, Joerg [1 ]
机构
[1] Univ Hosp Erlangen, Dept Phoniatr & Pediat Audiol, D-91054 Erlangen, Germany
关键词
VOCAL FOLD VIBRATIONS; VIDEOKYMOGRAPHY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For the diagnosis of pathological voices it; is of particular importance to examine the dynamic properties of the underlying vocal fold (VF) movements occurring at a fundamental frequency of 100-300 Hz. To this end, a patient's laryngeal oscillation patterns are captured with state-of-the-art endoscopic high-speed (HS) camera systems capable of recording 4000 frames/second. To date the clinical analysis of these HS videos is commonly performed in a subjective manner via slow-motion playback. Hence, the resulting diagnoses are inherently error-prone, exhibiting high inter-rater variability. In this paper an objective method for overcoming this drawback is presented which employs a quantitative description and classification approach based on a. novel image analysis,strategy called Phonovibrography. By extracting the relevant VF, movement, information from HS videos the spatio-temporal patterns of laryngeal activity are captured using a set of specialized features. As reference for performance, conventional voice analysis features are also computed. The derived features are analyzed with different machine learning (ML) algorithms regarding clinically meaningful classification tasks. The applicability of the approach is demonstrated using a clinical data. set comprising individuals with normophonic and paralytic voices. The results indicate that the presented approach holds a lot of promise for providing reliable diagnosis support, in the future.
引用
收藏
页码:315 / 324
页数:10
相关论文
共 50 条
  • [21] An automated method for collecting biomechanical data from high-speed videos of fish feeding
    Fiedler, K.
    Cooper, W. J.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2019, 59 : E70 - E70
  • [22] DETECTION AND CLASSIFICATION OF VOICE PATHOLOGY USING FEATURE SELECTION
    Al Mojaly, Malak
    Muhammad, Ghulam
    Alsulaiman, Mansour
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 571 - 577
  • [23] High-speed action recognition and localization in compressed domain videos
    Yeo, Chuohao
    Ahammad, Parvez
    Ramchandran, Karman
    Sastry, S. Shankar
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2008, 18 (08) : 1006 - 1015
  • [24] High-Speed and Accurate Laser Scan Matching Using Classified Features
    Shu, Lei
    Xu, Hu
    Huang, May
    2013 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2013), 2013,
  • [25] High-speed object matching and localization using gradient orientation features
    Xu, Xinyu
    van Beek, Peter
    Feng, Xiaofan
    INTELLIGENT ROBOTS AND COMPUTER VISION XXXI: ALGORITHMS AND TECHNIQUES, 2014, 9025
  • [26] HIGH-SPEED EVENT COUNTING AND CLASSIFICATION USING A DICTIONARY HASH TECHNIQUE
    MCKENNEY, PE
    PROCEEDINGS OF THE 1989 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, VOL 3: ALGORITHMS AND APPLICATIONS, 1989, : 71 - 75
  • [27] Using ALISA for high-speed classification of the components and their concentrations in mixtures of radioisotopes
    Portnoy, D
    Bock, P
    Heimberg, P
    Moore, E
    PENETRATING RADIATION SYSTEMS AND APPLICATIONS VI, 2004, 5541 : 1 - 10
  • [28] Voice Pathology Detection and Classification Using MPEG-7 Audio Low-Level Features
    Muhammad, Ghulam
    Melhem, Moutasem
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3594 - 3598
  • [29] Visualization of high-speed phenomena using high-speed infrared camera
    Yaoita, T.
    Marcotte, F.
    SELECTED PAPERS FROM THE 31ST INTERNATIONAL CONGRESS ON HIGH-SPEED IMAGING AND PHOTONICS, 2017, 10328
  • [30] Voice Pathology Detection and Classification Using Auto-Correlation and Entropy Features in Different Frequency Regions
    Al-Nasheri, Ahmed
    Muhammad, Ghulam
    Alsulaiman, Mansour
    Ali, Zulfiqar
    Malki, Khalid H.
    Mesallam, Tamer A.
    Ibrahim, Mohamed Farahat
    IEEE ACCESS, 2018, 6 : 6961 - 6974