Real-Time Contrast Enhancement to Improve Speech Recognition

被引:11
|
作者
Alexander, Joshua M. [1 ]
Jenison, Rick L. [2 ]
Kluender, Keith R. [2 ]
机构
[1] Purdue Univ, Dept Speech Language & Hearing Sci, W Lafayette, IN 47907 USA
[2] Univ Wisconsin, Dept Psychol, Madison, WI 53706 USA
来源
PLOS ONE | 2011年 / 6卷 / 09期
基金
美国国家卫生研究院;
关键词
HEARING-IMPAIRED LISTENERS; STOP-CONSONANT PERCEPTION; AUDITORY FILTER SHAPES; FREQUENCY-SELECTIVITY; SPECTRAL ENHANCEMENT; COCHLEAR IMPAIRMENTS; PSYCHOPHYSICAL DATA; PRECEDING LIQUID; TUNING CURVES; AID DELAYS;
D O I
10.1371/journal.pone.0024630
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
An algorithm that operates in real-time to enhance the salient features of speech is described and its efficacy is evaluated. The Contrast Enhancement (CE) algorithm implements dynamic compressive gain and lateral inhibitory sidebands across channels in a modified winner-take-all circuit, which together produce a form of suppression that sharpens the dynamic spectrum. Normal-hearing listeners identified spectrally smeared consonants (VCVs) and vowels (hVds) in quiet and in noise. Consonant and vowel identification, especially in noise, were improved by the processing. The amount of improvement did not depend on the degree of spectral smearing or talker characteristics. For consonants, when results were analyzed according to phonetic feature, the most consistent improvement was for place of articulation. This is encouraging for hearing aid applications because confusions between consonants differing in place are a persistent problem for listeners with sensorineural hearing loss.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Lightweight Real-Time Recurrent Models for Speech Enhancement and Automatic Speech Recognition
    Dhahbi, Sami
    Saleem, Nasir
    Gunawan, Teddy Surya
    Bourouis, Sami
    Ali, Imad
    Trigui, Aymen
    Algarni, Abeer D.
    [J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024, 8 (06):
  • [2] REAL-TIME SPEECH RECOGNITION
    CAELEN, J
    CASTAN, S
    PERENNOU, G
    [J]. AUTOMATISME, 1972, 17 (03): : 87 - &
  • [3] REAL-TIME ADAPTIVE CONTRAST ENHANCEMENT
    NARENDRA, PM
    FITCH, RC
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1981, 3 (06) : 655 - 661
  • [4] The Recognition of Whispered Speech in Real-Time
    Hendrickson, Kristi
    Ernest, Danielle
    [J]. EAR AND HEARING, 2022, 43 (02): : 554 - 562
  • [5] Real-time advanced contrast enhancement algorithm
    Kim, TC
    Huh, CW
    Kim, MJ
    Chung, BY
    Kim, SW
    [J]. COMPUTER AND INFORMATION SCIENCES - ISCIS 2003, 2003, 2869 : 691 - 698
  • [6] A FLEXIBLE ARCHITECTURE FOR REAL-TIME SPEECH RECOGNITION
    MORENO, F
    ALEXANDRES, S
    MENESES, J
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1993, 37 (1-5): : 69 - 72
  • [7] Real-time recognition of broadcast radio speech
    Cook, GD
    Christie, JD
    Clarkson, PR
    Hochberg, MM
    Logan, BT
    Robinson, AJ
    Seymour, CW
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 141 - 144
  • [8] Real-time Speech Enhancement with GCC-NMF
    Wood, Sean U. N.
    Rouat, Jean
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2665 - 2669
  • [9] REAL-TIME SPEECH ENHANCEMENT USING EQUILIBRIATED RNN
    Takeuchi, Daiki
    Yatabe, Kohei
    Koizumi, Yuma
    Oikawa, Yasuhiro
    Harada, Noboru
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 851 - 855
  • [10] REAL-TIME PROCESSING OF LOCAL CONTRAST ENHANCEMENT ON FPGA
    Kokufuta, Kentaro
    Maruyama, Tsutomu
    [J]. FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 288 - 293