SPEAKER-INDEPENDENT BRAIN ENHANCED SPEECH DENOISING

被引:5
|
作者
Hosseini, Maryam [1 ]
Celotti, Luca [1 ]
Plourde, Eric [1 ]
机构
[1] Univ Sherbrooke, NECOTIS, Dept Elect & Comp Engn, Sherbrooke, PQ, Canada
关键词
speech enhancement; deep learning; EEG signals;
D O I
10.1109/ICASSP39728.2021.9414969
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The auditory system is extremely efficient in extracting attended auditory information in the presence of competing speakers. Single-channel speech enhancement algorithms, however, greatly lack this efficacy. In this paper, we propose a novel deep learning method referred to as the Brain Enhanced Speech Denoiser (BESD), that takes advantage of the attended auditory information present in the brain activity of the listener to denoise a multi-talker speech. We use this information to modulate the features learned from the sound and the brain activity, in order to perform speech enhancement. We show that our method successfully enhances a speech mixture, without prior information about the attended speaker, using electroencephalography (EEG) signals recorded from the listener. This makes it a great candidate for realistic applications where no prior information about the attended speaker is available, such as hearing aids or cell phones.
引用
收藏
页码:1310 / 1314
页数:5
相关论文
共 50 条
  • [1] SPEAKER-INDEPENDENT CONTINUOUS SPEECH DICTATION
    GAUVAIN, JL
    LAMEL, LF
    ADDA, G
    ADDADECKER, M
    [J]. SPEECH COMMUNICATION, 1994, 15 (1-2) : 21 - 37
  • [2] The study on continuous speech of speaker-independent
    Ye Hong
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A) : 921 - 924
  • [3] SPEAKER-INDEPENDENT VOWEL RECOGNITION IN PERSIAN SPEECH
    Nazari, Mohammad
    Sayadiyan, Abolghasem
    Valiollahzadeh, Seyyed Majid
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 672 - 676
  • [4] PREDICTOR CODEBOOK FOR SPEAKER-INDEPENDENT SPEECH RECOGNITION
    KAWABATA, T
    [J]. SYSTEMS AND COMPUTERS IN JAPAN, 1994, 25 (01) : 37 - 46
  • [5] Japanese Speaker-Independent Homonyms Speech Recognition
    Murakami, Jin'ichi
    Hotta, Haseo
    [J]. COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 306 - 313
  • [6] On Speaker-Independent, Speaker-Dependent, and Speaker-Adaptive Speech Recognition
    Huang, Xuedong
    Lee, Kai-Fu
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 150 - 157
  • [7] Speaker adaptation techniques for speech recognition with a speaker-independent phonetic recognizer
    Kim, WG
    Jang, M
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 95 - 100
  • [8] SPEAKER-INDEPENDENT DETECTION OF CHILD-DIRECTED SPEECH
    Schuster, Sebastian
    Pancoast, Stephanie
    Ganjoo, Milind
    Frank, Michael C.
    Jurafsky, Dan
    [J]. 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 366 - 371
  • [9] Speaker-Independent Speech Recognition using Visual Features
    Pooventhiran, G.
    Sandeep, A.
    Manthiravalli, K.
    Harish, D.
    Renuka, Karthika D.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 616 - 620
  • [10] SPEAKER-CONSISTENT PARSING FOR SPEAKER-INDEPENDENT CONTINUOUS SPEECH RECOGNITION
    YAMAGUCHI, K
    SINGER, H
    MATSUNAGA, S
    SAGAYAMA, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (06) : 719 - 724