Intelligibility predictors and neural representation of speech

被引:3
|
作者
Lobdell, B. E. [1 ]
Allen, J. B. [1 ]
Hasegawa-Johnson, M. A. [1 ]
机构
[1] Univ Illinois, Beckman Inst, Urbana, IL 61820 USA
基金
美国国家科学基金会;
关键词
Speech perception; Articulation; Index; Speech recognition; Speech representation; ARTICULATION INDEX; <M>-<N> DISTINCTION; WORD RECOGNITION; PERCEPTION; NOISE; MODEL; IDENTIFICATION; PLACE; CUES; INTEGRATION;
D O I
10.1016/j.specom.2010.08.016
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Intelligibility predictors tell us a great deal about human speech perception, in particular which acoustic factors strongly effect human behavior and which do not A particular intelligibility predictor, the Articulation Index (AI), is interesting because it models human behavior in noise, and its form has implications about representation of speech in the brain Specifically, the Articulation Index implies that a listener pre-consciously estimates the making noise distribution and uses it to classify time/frequency samples as speech or non-speech We classify consonants using representations of speech and noise which are consistent with this hypothesis and determine whether their error rate and error patterns are more or less consistent with human behavior than representations typical of automatic speech recognition systems The new representations resulted in error patterns more similar to humans in cases where the testing and training data sets do not have the same masking noise spectrum (C) 2010 Elsevier B V All rights reserved
引用
收藏
页码:185 / 194
页数:10
相关论文
共 50 条
  • [21] Binaural Speech Intelligibility Estimation Using Deep Neural Networks
    Kondo, Kazuhiro
    Taira, Kazuya
    Kobayashi, Yosuke
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1858 - 1862
  • [22] Neural Decoding of the Speech Envelope: Effects of Intelligibility and Spectral Degradation
    Macintyre, Alexis Deighton
    Carlyon, Robert P.
    Goehring, Tobias
    TRENDS IN HEARING, 2024, 28
  • [23] Neural tracking of speech envelope does not unequivocally reflect intelligibility
    Koesem, Anne
    Dai, Bohan
    McQueen, James M.
    Hagoort, Peter
    NEUROIMAGE, 2023, 272
  • [24] Nonintrusive Speech Intelligibility Prediction Using Convolutional Neural Networks
    Andersen, Asger Heidemann
    de Haan, Jan Mark
    Tan, Zheng-Hua
    Jensen, Jesper
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1925 - 1939
  • [25] Neural speech tracking shifts from the syllabic to the modulation rate of speech as intelligibility decreases
    Schmidt, Fabian
    Chen, Ya-Ping
    Keitel, Anne
    Roesch, Sebastian
    Hannemann, Ronny
    Serman, Maja
    Hauswald, Anne
    Weisz, Nathan
    PSYCHOPHYSIOLOGY, 2023, 60 (11)
  • [26] SPEECH BREATHING AND SPEECH INTELLIGIBILITY
    Hudgins, Clarence V.
    VOLTA REVIEW, 1946, 48 (11) : 642 - 644
  • [27] The Representation of Speech in Deep Neural Networks
    Scharenborg, Odette
    van der Gouw, Nikki
    Larson, Martha
    Marchiori, Elena
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 194 - 205
  • [28] Neural representation of amplified speech sounds
    Tremblay, KL
    Billings, CJ
    Friesen, LM
    Souza, PE
    EAR AND HEARING, 2006, 27 (02): : 93 - 103
  • [29] Acoustic Predictors of Intelligibility for Segmentally Interrupted Speech: Temporal Envelope, Voicing, and Duration
    Fogerty, Daniel
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2013, 56 (05): : 1402 - 1408
  • [30] Neural Network Based Curve Fitting to Enhance the Intelligibility of Dysarthric Speech
    Pavithra, K. S.
    Chandrashekar, H. M.
    Karjigi, Veena
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 545 - 553