Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

被引:2
|
作者
Poluboina, Venkateswarlu [1 ]
Pulikala, Aparna [1 ]
Muthu, Arivudai Nambi Pitchai [2 ]
机构
[1] Natl Inst Technol Karnataka, Dept Elect & Commun, Mangalore 575025, Karnataka, India
[2] Dept Audiol & Speech Language Pathol, Mangalore 575001, Karnataka, India
关键词
Cochlear implant signal processing; Temporal fine structure; Proportional frequency compression; Vocoder simulation; Speech recognition; PERFORMANCE; HEARING; ENCODER; PITCH;
D O I
10.1016/j.apacoust.2021.108616
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The study investigated the effect of proportionally frequency compressed encoding of temporal fine structure information on speech perception in noise using vocoder simulations of cochlear implant signal processing. The study proposed a pitch synchronous overlap-add algorithm (PSOLA) for downward frequency shifting of TFS. The speech recognition scores (SRS) were measured at-10 dB, 0 dB, and +10 dB for eight signal processing conditions corresponding to sinewave vocoder without TFS (NOTFS), four unshifted TFS conditions including full band TFS, TFS up to 2000, 1000, and 600 Hz, and three conditions with PSOLA which shifted 2000, 1000 and 600 Hz TFS to 1000, 500 and 300 Hz respectively. The original envelope was unchanged across the conditions. SRS at +10 dB and-10 dB SNR reached ceiling and floor respectively, in most conditions. Hence, SRS at 0 dB SNR was compared across the conditions. The results showed that the SRS was highest with full band TFS and lowest for the NO-TFS condition.The SRS for TFS 600 Hz shifted to 300 Hz through PSOLA was higher than the NO-TFS condition. Study findings suggest that encoding TFS by proportional frequency compression results in better speech perception in noise compared to NO-TFS. An important observation of this current study is that the speech recognition was better than the sine wave vocoder for all TFS conditions including frequency compressed 600 Hz TFS.(c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:5
相关论文
共 47 条
  • [41] Relations between frequency selectivity, temporal fine-structure processing, and speech reception in impaired hearing
    Strelcyk, Olaf
    Dau, Torsten
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (05): : 3328 - 3345
  • [42] Temporal fine structure: associations with cognition and speech-in-noise recognition in adults with normal hearing or hearing impairment
    Ellis, Rachel J.
    Ronnberg, Jerker
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2022, 61 (09) : 778 - 786
  • [43] Effects of steep high-frequency hearing loss on speech recognition using temporal fine structure in low-frequency region
    Li, Bei
    Hou, Limin
    Xu, Li
    Wang, Hui
    Yang, Guang
    Yin, Shankai
    Feng, Yanmei
    HEARING RESEARCH, 2015, 326 : 66 - 74
  • [44] Comparison of the fine structure processing (FSP) strategy and the CIS strategy used in the MED-EL cochlear implant system: Speech intelligibility and music sound quality
    Magnusson, Lennart
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2011, 50 (04) : 279 - 287
  • [45] Effects of spectral smearing and temporal fine-structure distortion on the fluctuating-masker benefit for speech at a fixed signal-to-noise ratio
    Bernstein, Joshua G. W.
    Brungart, Douglas S.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 130 (01): : 473 - 488
  • [46] Low-frequency fine-structure cues allow for the online use of lexical stress during spoken-word recognition in spectrally degraded speech
    Kong, Ying-Yee
    Jesse, Alexandra
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (01): : 373 - 382
  • [47] Musical Sound Quality in Cochlear Implant Users: A Comparison in Bass Frequency Perception Between Fine Structure Processing and High-Definition Continuous Interleaved Sampling Strategies
    Roy, Alexis T.
    Carver, Courtney
    Jiradejvong, Patpong
    Limb, Charles J.
    EAR AND HEARING, 2015, 36 (05): : 582 - 590