Speech recognition under conditions of frequency-place compression and expansion

被引:80
|
作者
Baskent, D
Shannon, RV
机构
[1] House Ear Res Inst, Dept Auditory Implants & Percept, Los Angeles, CA 90057 USA
[2] Univ So Calif, Dept Biomed Engn, Los Angeles, CA 90089 USA
来源
关键词
D O I
10.1121/1.1558357
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In normal acoustic hearing the mapping of acoustic frequency information onto the appropriate, cochlear place is a natural biological function, but in cochlear implants it is controlled by. the speech processor. The cochlear tonotopic range of the implant is determined by the length and insertion depth of the electrode array. Conventional cochlear implant electrode arrays are designed for an insertion of 25 mm inside the round window and the active electrodes occupy 16 mm, which would place the electrodes in a cochlear region corresponding to an acoustic frequency range of 500-6000 Hz. However, some implant speech processors map an,acoustic frequency range from 150 to 10 000 Hz onto these electrodes. While this mapping preserves the entire range of acoustic frequency information, it also results in a compression of the tonotopic pattern of speech information delivered to the brain. The present study measured the effects of such a compression of frequency-to-place mapping on speech recognition using acoustic simulations. Also measured were the effects, of an expansion of the frequency-to-place mapping, which produces an expanded representation of speech in the cochlea. Such an expanded representation might improve speech recognition. by improving the relative spatial (tonotopic) resolution, like. an "acoustic fovea." Phoneme and sentence recognition was measured as a function of linear (in terms of cochlear distance) frequency-place compression and expansion. These conditions were presented to normal-hearing listeners using a noise-band vocoder, simulating cochlear implant electrodes with different insertion depths and different number of electrode channels. The cochlear tonotopic range was held constant by employing the same noise carrier bands for each condition, while the analysis frequency range was either compressed or expanded relative to the carrier frequency range. For each condition, the result was compared to that of the perfect frequency-place match, where the carrier and the analysis bands were perfectly matched. Speech recognition in the matched conditions was generally better than any,condition of frequency-place expansion and compression, even when the matched condition. eliminated a considerable amount of acoustic information. This result suggests that speech recognition, at least without training, is dependent. on the. mapping of acoustic frequency information onto the appropriate cochlear place. C 2003 Acoustical Society of America.
引用
收藏
页码:2064 / 2076
页数:13
相关论文
共 50 条
  • [31] WFST Compression for Automatic Speech Recognition
    Caseiro, Diamantino
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1493 - 1496
  • [32] Effects of envelope expansion on speech recognition
    Lorenzi, C
    Berthommier, F
    Apoux, F
    Bacri, N
    HEARING RESEARCH, 1999, 136 (1-2) : 131 - 138
  • [33] Speech recognition performance of patients with sensorineural hearing loss under unaided and aided conditions using linear and compression hearing aids
    Shanks, JE
    Wilson, RH
    Larson, V
    Williams, D
    EAR AND HEARING, 2002, 23 (04): : 280 - 290
  • [34] Listening Effort and Speech Recognition with Frequency Compression Amplification for Children and Adults with Hearing Loss
    Brennan, Marc A.
    Lewis, Dawna
    McCreery, Ryan
    Kopun, Judy
    Alexander, Joshua M.
    JOURNAL OF THE AMERICAN ACADEMY OF AUDIOLOGY, 2017, 28 (09) : 823 - 837
  • [36] Effects of nonlinear frequency compression on the acoustic properties and recognition of speech sounds in Mandarin Chinese
    Yang, Jing
    Qian, Jinyu
    Chen, Xueqing
    Kuehnel, Volker
    Rehmann, Julia
    von Buol, Andreas
    Li, Yulin
    Ren, Cuncun
    Liu, Bo
    Xu, Li
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2018, 143 (03): : 1578 - 1590
  • [37] The Influence of Audibility on Speech Recognition With Nonlinear Frequency Compression for Children and Adults With Hearing Loss
    McCreery, Ryan W.
    Alexander, Joshua
    Brennan, Marc A.
    Hoover, Brenda
    Kopun, Judy
    Stelmachowicz, Patricia G.
    EAR AND HEARING, 2014, 35 (04): : 440 - 447
  • [38] Automatic Isolated Kannada Speech Recognition System under Degraded Conditions
    Yadava, Thimmaraja G.
    Jayanna, H. S.
    2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2019, : 146 - 150
  • [39] Experiments on Children's Speech Recognition under Acoustically Mismatched Conditions
    Kathania, Hemant Kumar
    Shahnawazuddin, S.
    Pradhan, Gayadhar
    Samaddar, A. B.
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 3014 - 3017
  • [40] A Unified Recognition and Correction Model under Noisy and Accent Speech Conditions
    Yang, Zhao
    Ng, Dianwen
    Zhang, Chong
    Jiang, Rui
    Xi, Wei
    Ma, Yukun
    Ni, Chongjia
    Zhao, Jizhong
    Ma, Bin
    Chng, Eng Siong
    INTERSPEECH 2023, 2023, : 4953 - 4957