SPEAKER-INDEPENDENT BRAIN ENHANCED SPEECH DENOISING

被引：5

作者：

Hosseini, Maryam ^{[1
]}

Celotti, Luca ^{[1
]}

Plourde, Eric ^{[1
]}

机构：

[1] Univ Sherbrooke, NECOTIS, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

speech enhancement; deep learning; EEG signals;

D O I：

10.1109/ICASSP39728.2021.9414969

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The auditory system is extremely efficient in extracting attended auditory information in the presence of competing speakers. Single-channel speech enhancement algorithms, however, greatly lack this efficacy. In this paper, we propose a novel deep learning method referred to as the Brain Enhanced Speech Denoiser (BESD), that takes advantage of the attended auditory information present in the brain activity of the listener to denoise a multi-talker speech. We use this information to modulate the features learned from the sound and the brain activity, in order to perform speech enhancement. We show that our method successfully enhances a speech mixture, without prior information about the attended speaker, using electroencephalography (EEG) signals recorded from the listener. This makes it a great candidate for realistic applications where no prior information about the attended speaker is available, such as hearing aids or cell phones.

引用

页码：1310 / 1314

页数：5

共 50 条

[1] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
GAUVAIN, JL
LAMEL, LF
ADDA, G
ADDADECKER, M
[J]. SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
[2] The study on continuous speech of speaker-independent
Ye Hong
[J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A) : 921 - 924
[3] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
Nazari, Mohammad
Sayadiyan, Abolghasem
Valiollahzadeh, Seyyed Majid
[J]. 2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
[4] PREDICTOR CODEBOOK FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
KAWABATA, T
[J]. SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (01) : 37 - 46
[5] Japanese Speaker-Independent Homonyms Speech Recognition
Murakami, Jin'ichi
Hotta, Haseo
[J]. COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313
[6] On Speaker-Independent, Speaker-Dependent, and Speaker-Adaptive Speech Recognition
Huang, Xuedong
Lee, Kai-Fu
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 150 - 157
[7] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
Kim, WG
Jang, M
[J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
[8] SPEAKER-INDEPENDENT DETECTION OF CHILD-DIRECTED SPEECH
Schuster, Sebastian
Pancoast, Stephanie
Ganjoo, Milind
Frank, Michael C.
Jurafsky, Dan
[J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 366 - 371
[9] Speaker-Independent Speech Recognition using Visual Features
Pooventhiran, G.
Sandeep, A.
Manthiravalli, K.
Harish, D.
Renuka, Karthika D.
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 616 - 620
[10] SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
YAMAGUCHI, K
SINGER, H
MATSUNAGA, S
SAGAYAMA, S
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 719 - 724

← 1 2 3 4 5 →