Prediction of Speech Intelligibility Using a Neurogram Orthogonal Polynomial Measure (NOPM)

被引:20
|
作者
Mamun, Nursadul [1 ]
Jassim, Wissam A. [1 ]
Zilany, Muhammad S. A. [1 ]
机构
[1] Univ Malaya, Dept Biomed Engn, Kuala Lumpur 50603, Malaysia
关键词
Auditory-nerve model; neurogram; orthogonal moment; sensorineural hearing loss; speech intelligibility; AUDITORY-NERVE FIBERS; IMAGE QUALITY ASSESSMENT; TEMPORAL FINE-STRUCTURE; PHENOMENOLOGICAL MODEL; FREQUENCY-MODULATION; RECEPTION THRESHOLD; WORD RECOGNITION; VOWEL EPSILON; NOISE; RESPONSES;
D O I
10.1109/TASLP.2015.2401513
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sensorineural hearing loss (SNHL) is an increasingly prevalent condition, resulting from damage to the inner ear and causing a reduction in speech intelligibility. This paper proposes a new speech intelligibility prediction metric, the neurogram orthogonal polynomial measure (NOPM). This metric applies orthogonal moments to the auditory neurogram to predict speech intelligibility for listeners with and without hearing loss. The model simulates the responses of auditory-nerve fibers to speech signals under quiet and noisy conditions. Neurograms were created using a physiologically based computational model of the auditory periphery. A well-known orthogonal polynomial measure, Krawtchouk moments, was applied to extract features from the auditory neurogram. The predicted intelligibility scores were compared to subjective results, and NOPM showed a good fit with the subjective scores for normal listeners and also for listeners with hearing loss. The proposed metric has a realistic and wider dynamic range than corresponding existing metrics, such as mean structural similarity index measure and neurogram similarity index measure, and the predicted scores are also well-separated as a function of hearing loss. The application of this metric could be extended for assessing hearing-aid and speech-enhancement algorithms.
引用
收藏
页码:760 / 773
页数:14
相关论文
共 50 条
  • [1] Speech intelligibility prediction using a Neurogram Similarity Index Measure
    Hines, Andrew
    Harte, Naomi
    SPEECH COMMUNICATION, 2012, 54 (02) : 306 - 320
  • [2] Reference-Free Assessment of Speech Intelligibility Using Bispectrum of an Auditory Neurogram
    Hossain, Mohammad E.
    Jassim, Wissam A.
    Zilany, Muhammad S. A.
    PLOS ONE, 2016, 11 (03):
  • [3] Speech quality assessment using 2D neurogram orthogonal moments
    Jassim, Wissam A.
    Zilany, Muhammad S. A.
    SPEECH COMMUNICATION, 2016, 80 : 34 - 48
  • [4] On the feasibility of using a bispectral measure as a nonintrusive predictor of speech intelligibility
    Hossain, Md Ekramul
    Zilany, Muhammad S. A.
    Davies-Venn, Evelyn
    COMPUTER SPEECH AND LANGUAGE, 2019, 57 : 59 - 80
  • [5] Using Acoustic Parameters for Intelligibility Prediction of Reverberant Speech
    Alghamdi, Ahmed
    Chan, Wai-Yip
    Fogerty, Daniel
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2534 - 2538
  • [6] ON MUTUAL INFORMATION AS A MEASURE OF SPEECH INTELLIGIBILITY
    Taghia, Jalal
    Martin, Rainer
    Hendriks, Richard C.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 65 - 68
  • [7] PREDICTION OF SPEECH INTELLIGIBILITY IN NOISE
    PICKETT, JM
    KRYTER, KD
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1954, 26 (05): : 952 - 953
  • [8] Prediction of Arabic speech intelligibility for speech hall
    El Awady, R
    El Malawany, AI
    El Messiry, MA
    Fayed, HS
    2002 IEEE PROCEEDINGS OF THE NINETEENTH NATIONAL RADIO SCIENCE CONFERENCE, VOLS 1 AND 2, 2002, : 214 - 223
  • [9] Using Automatic Speech Recognition to Measure the Intelligibility of Speech Synthesized from Brain Signals
    Varshney, Suvi
    Farias, Dana
    Brandman, David M.
    Stavisky, Sergey D.
    Miller, Lee M.
    2023 11TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, NER, 2023,
  • [10] Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks
    Andersen, Asger Heidemann
    de Haan, Jan Mark
    Tan, Zheng-Hua
    Jensen, Jesper
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1925 - 1939