SPEECH INTELLIGIBILITY PREDICTION AS A CLASSIFICATION PROBLEM

被引:0
|
作者
Andersen, Asger Heidemann [1 ]
Schoenmaker, Esther [2 ]
van de Par, Steven [2 ]
机构
[1] Oticon AS, DK-2765 Smorum, Denmark
[2] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4all, Dept Med Phys & Acoust, D-26111 Oldenburg, Germany
关键词
Speech intelligibility prediction; speech enhancement; binary classification; applications of machine learning; RECEPTION THRESHOLD; NOISE; RECOGNITION; PERCEPTION; INDEX;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech Intelligibility Prediction (SIP) algorithms are becoming increasingly popular for objective evaluation of speech processing algorithms and transmission systems. Most often, SIP algorithms aim to predict the average intelligibility of an average listener in some specific listening condition. In the present work, we instead consider the aim of predicting the intelligibility of singlewords. I.e. we attempt to predict whether or not a subject in a listening experiment was able to correctly repeat a particular word. We base the prediction on a noisy and potentially processed/degraded recording of the spoken word (as presented to a subject), as well as a clean reference recording of the spoken word. The problem can be treated as a supervised binary classification problem of predicting whether a specific word will or will not be understood. We investigate a number of different ways to extract features from the degraded and clean speech samples. The classification is carried out by means of Fisher discriminant analysis. Despite the large variability of speech intelligibility experiments, it is possible to obtain a considerable degree of predictive power.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] PREDICTION OF SPEECH INTELLIGIBILITY IN NOISE
    PICKETT, JM
    KRYTER, KD
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1954, 26 (05): : 952 - 953
  • [2] Prediction of Arabic speech intelligibility for speech hall
    El Awady, R
    El Malawany, AI
    El Messiry, MA
    Fayed, HS
    2002 IEEE PROCEEDINGS OF THE NINETEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2002, : 214 - 223
  • [3] NON-INTRUSIVE BINAURAL PREDICTION OF SPEECH INTELLIGIBILITY BASED ON PHONEME CLASSIFICATION
    Rossbach, Jana
    Roettges, Saskia
    Hauth, Christopher F.
    Brand, Thomas
    Meyer, Bernd T.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 396 - 400
  • [4] PREDICTION OF INTELLIGIBILITY OF NONLINEARLY PROCESSED SPEECH
    LUDVIGSEN, C
    ELBERLING, C
    KEIDSER, G
    POULSEN, T
    ACTA OTO-LARYNGOLOGICA, 1990, : 190 - 195
  • [5] Classification of speech intelligibility in Parkinson's disease
    Khan, Taha
    Westin, Jerker
    Dougherty, Mark
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2014, 34 (01) : 35 - 45
  • [6] PREDICTION OF SPEECH INTELLIGIBILITY AT HIGH NOISE LEVELS
    PICKETT, JM
    POLLACK, I
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1957, 29 (11): : 1262 - 1263
  • [7] Modified ESTOI for improving speech intelligibility prediction
    Alghamdi, Ahmed
    Chan, Wai-Yip
    2020 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2020,
  • [8] An improved speech transmission index for intelligibility prediction
    Schwerin, Belinda
    Paliwal, Kuldip
    SPEECH COMMUNICATION, 2014, 65 : 9 - 19
  • [9] NUMERICAL PREDICTION OF ECHOGRAMS AND OF INTELLIGIBILITY OF SPEECH IN ROOMS
    SANTON, F
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 59 (06): : 1399 - 1405
  • [10] Speech Intelligibility Prediction Based on Mutual Information
    Jensen, Jesper
    Taal, Cees H.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 430 - 440