Real-Time Contrast Enhancement to Improve Speech Recognition

被引：11

作者：

Alexander, Joshua M. ^{[1
]}

Jenison, Rick L. ^{[2
]}

Kluender, Keith R. ^{[2
]}

机构：

[1] Purdue Univ, Dept Speech Language & Hearing Sci, W Lafayette, IN 47907 USA

[2] Univ Wisconsin, Dept Psychol, Madison, WI 53706 USA

来源：

PLOS ONE | 2011年 / 6卷 / 09期

基金：

美国国家卫生研究院;

关键词：

HEARING-IMPAIRED LISTENERS; STOP-CONSONANT PERCEPTION; AUDITORY FILTER SHAPES; FREQUENCY-SELECTIVITY; SPECTRAL ENHANCEMENT; COCHLEAR IMPAIRMENTS; PSYCHOPHYSICAL DATA; PRECEDING LIQUID; TUNING CURVES; AID DELAYS;

D O I：

10.1371/journal.pone.0024630

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

An algorithm that operates in real-time to enhance the salient features of speech is described and its efficacy is evaluated. The Contrast Enhancement (CE) algorithm implements dynamic compressive gain and lateral inhibitory sidebands across channels in a modified winner-take-all circuit, which together produce a form of suppression that sharpens the dynamic spectrum. Normal-hearing listeners identified spectrally smeared consonants (VCVs) and vowels (hVds) in quiet and in noise. Consonant and vowel identification, especially in noise, were improved by the processing. The amount of improvement did not depend on the degree of spectral smearing or talker characteristics. For consonants, when results were analyzed according to phonetic feature, the most consistent improvement was for place of articulation. This is encouraging for hearing aid applications because confusions between consonants differing in place are a persistent problem for listeners with sensorineural hearing loss.

引用

页数：11

共 50 条

[1] Lightweight Real-Time Recurrent Models for Speech Enhancement and Automatic Speech Recognition
Dhahbi, Sami
Saleem, Nasir
Gunawan, Teddy Surya
Bourouis, Sami
Ali, Imad
Trigui, Aymen
Algarni, Abeer D.
[J]. INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024, 8 (06):
[2] REAL-TIME SPEECH RECOGNITION
CAELEN, J
CASTAN, S
PERENNOU, G
[J]. AUTOMATISME, 1972, 17 (03): : 87 - &
[3] REAL-TIME ADAPTIVE CONTRAST ENHANCEMENT
NARENDRA, PM
FITCH, RC
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1981, 3 (06) : 655 - 661
[4] The Recognition of Whispered Speech in Real-Time
Hendrickson, Kristi
Ernest, Danielle
[J]. EAR AND HEARING, 2022, 43 (02): : 554 - 562
[5] Real-time advanced contrast enhancement algorithm
Kim, TC
Huh, CW
Kim, MJ
Chung, BY
Kim, SW
[J]. COMPUTER AND INFORMATION SCIENCES - ISCIS 2003, 2003, 2869 : 691 - 698
[6] A FLEXIBLE ARCHITECTURE FOR REAL-TIME SPEECH RECOGNITION
MORENO, F
ALEXANDRES, S
MENESES, J
[J]. MICROPROCESSING AND MICROPROGRAMMING, 1993, 37 (1-5): : 69 - 72
[7] Real-time recognition of broadcast radio speech
Cook, GD
Christie, JD
Clarkson, PR
Hochberg, MM
Logan, BT
Robinson, AJ
Seymour, CW
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 141 - 144
[8] Real-time Speech Enhancement with GCC-NMF
Wood, Sean U. N.
Rouat, Jean
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2665 - 2669
[9] REAL-TIME SPEECH ENHANCEMENT USING EQUILIBRIATED RNN
Takeuchi, Daiki
Yatabe, Kohei
Koizumi, Yuma
Oikawa, Yasuhiro
Harada, Noboru
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 851 - 855
[10] REAL-TIME PROCESSING OF LOCAL CONTRAST ENHANCEMENT ON FPGA
Kokufuta, Kentaro
Maruyama, Tsutomu
[J]. FPL: 2009 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, 2009, : 288 - 293

← 1 2 3 4 5 →