Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing

被引:2
|
作者
Poluboina, Venkateswarlu [1 ]
Pulikala, Aparna [1 ]
Muthu, Arivudai Nambi Pitchai [2 ]
机构
[1] Natl Inst Technol Karnataka, Dept Elect & Commun, Mangalore 575025, Karnataka, India
[2] Dept Audiol & Speech Language Pathol, Mangalore 575001, Karnataka, India
关键词
Cochlear implant signal processing; Temporal fine structure; Proportional frequency compression; Vocoder simulation; Speech recognition; PERFORMANCE; HEARING; ENCODER; PITCH;
D O I
10.1016/j.apacoust.2021.108616
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The study investigated the effect of proportionally frequency compressed encoding of temporal fine structure information on speech perception in noise using vocoder simulations of cochlear implant signal processing. The study proposed a pitch synchronous overlap-add algorithm (PSOLA) for downward frequency shifting of TFS. The speech recognition scores (SRS) were measured at-10 dB, 0 dB, and +10 dB for eight signal processing conditions corresponding to sinewave vocoder without TFS (NOTFS), four unshifted TFS conditions including full band TFS, TFS up to 2000, 1000, and 600 Hz, and three conditions with PSOLA which shifted 2000, 1000 and 600 Hz TFS to 1000, 500 and 300 Hz respectively. The original envelope was unchanged across the conditions. SRS at +10 dB and-10 dB SNR reached ceiling and floor respectively, in most conditions. Hence, SRS at 0 dB SNR was compared across the conditions. The results showed that the SRS was highest with full band TFS and lowest for the NO-TFS condition.The SRS for TFS 600 Hz shifted to 300 Hz through PSOLA was higher than the NO-TFS condition. Study findings suggest that encoding TFS by proportional frequency compression results in better speech perception in noise compared to NO-TFS. An important observation of this current study is that the speech recognition was better than the sine wave vocoder for all TFS conditions including frequency compressed 600 Hz TFS.(c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:5
相关论文
共 47 条
  • [1] Contribution of frequency compressed temporal fine structure cues to the speech recognition in noise: An implication in cochlear implant signal processing (vol 189, 108616, 2022)
    Poluboina, Venkateswarlu
    Pulikala, Aparna
    Pitchaimuthu, Arivudai Nambi
    APPLIED ACOUSTICS, 2022, 192
  • [2] Contributions of Temporal Fine Structure Cues to Chinese Speech Recognition in Cochlear Implant Simulation
    Yang, Lin
    Zhang, Jianping
    Yan, Yonghong
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 765 - 768
  • [3] Temporal processing and speech recognition in cochlear implant users
    Fu, QJ
    NEUROREPORT, 2002, 13 (13) : 1635 - 1639
  • [4] Temporal envelope cues and simulations of cochlear implant signal processing
    Goldsworthy, Raymond L.
    SPEECH COMMUNICATION, 2019, 109 : 24 - 33
  • [5] Effects of temporal fine structure stimulation on Mandarin speech recognition in cochlear implant users
    Qi, Beier
    Krenmayr, Andreas
    Zhang, Ning
    Dong, Ruijuan
    Chen, Xueqing
    Schatzer, Reinhold
    Zierhofer, Clemens
    Liu, Bo
    Han, Demin
    ACTA OTO-LARYNGOLOGICA, 2012, 132 (11) : 1183 - 1191
  • [6] Temporal Fine Structure Processing, Pitch, and Speech Perception in Adult Cochlear Implant Recipients
    D'Alessandro, Hilal Dincer
    Ballantyne, Deborah
    Boyle, Patrick J.
    De Seta, Elio
    DeVincentiis, Marco
    Mancini, Patrizia
    EAR AND HEARING, 2018, 39 (04): : 679 - 686
  • [7] Digisonic® cochlear implant signal processing for speech intelligibility improvement in noise
    Wable, J
    Gallego, S
    Chouard, CH
    Meyer, B
    COCHLEAR IMPLANTS - AN UPDATE, 2002, : 159 - 164
  • [8] Speech recognition outcomes in Mandarin-speaking cochlear implant users with fine structure processing
    Qi, Beier
    Liu, Ziye
    Gu, Xin
    Liu, Bo
    ACTA OTO-LARYNGOLOGICA, 2017, 137 (03) : 286 - 292
  • [9] Contribution of noise reduction pre-processing and microphone directionality strategies in the speech recognition in noise in adult cochlear implant users
    Goffi-Gomez, Maria Valeria Schmidt
    Muniz, Lilian
    Wiemes, Gislaine
    Onuki, Lucia Cristina
    Calonga, Luciane
    Osterne, Francisco Jose
    Kos, Maria Isabel
    Caldas, Fernanda Ferreira
    Cardoso, Carolina
    Cagnacci, Byanka
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2021, 278 (08) : 2823 - 2828
  • [10] Contribution of noise reduction pre-processing and microphone directionality strategies in the speech recognition in noise in adult cochlear implant users
    Maria Valeria Schmidt Goffi-Gomez
    Lilian Muniz
    Gislaine Wiemes
    Lucia Cristina Onuki
    Luciane Calonga
    Francisco José Osterne
    Maria Isabel Kós
    Fernanda Ferreira Caldas
    Carolina Cardoso
    Byanka Cagnacci
    European Archives of Oto-Rhino-Laryngology, 2021, 278 : 2823 - 2828