Speech Recognition Using Blind Source Separation and Dereverberation Method for Mixed Sound of Speech and Music

被引:0
|
作者
Wang, Longbiao [1 ]
Odani, Kyohei [2 ]
Kai, Atsuhiko [2 ]
Li, Weifeng [3 ]
机构
[1] Nagaoka Univ Technol, Nagaoka, Niigata 9402188, Japan
[2] Shizuoka Univ, Grad Sch Engn, Hamamatsu, Shizuoka 4328561, Japan
[3] Tsinghua Univ, Shenzhen 100084, Peoples R China
关键词
hands-free speech recognition; blind dereverberation; blind source separation; multi-channel least mean square; generalized spectral subtraction; INDEPENDENT COMPONENT ANALYSIS; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a method for performing a non-stationary noise reduction and dereverberation method. We use a blind dereverberation method based on spectral subtraction using a multi-channel least mean square algorithm has been proposed in our previous study. To suppress the non-stationary noise, we used a blind source separation based on an efficient fast independent component analysis algorithm. This method is evaluated using a mixed sound of speech and music, and achieves an average relative word error reduction rate of 41.9% and 7.9% compared with a baseline method and the state-of-the-art multi-step linear prediction-based dereverberation, respectively, in a real environment.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] The Influence of Blind Source Separation on Mixed Audio Speech and Music Emotion Recognition
    Laugs, Casper
    Koops, Hendrik Vincent
    Odijk, Daan
    Kaya, Heysem
    Volk, Anja
    [J]. COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 67 - 71
  • [2] A Semi-blind Source Separation Approach for Speech Dereverberation
    Wang, Ziteng
    Na, Yueyue
    Liu, Zhang
    Li, Yun
    Tian, Biao
    Fu, Qiang
    [J]. INTERSPEECH 2020, 2020, : 3925 - 3929
  • [3] Blind Speech Separation and Dereverberation using neural beamforming
    Pfeifenberger, Lukas
    Pernkopf, Franz
    [J]. SPEECH COMMUNICATION, 2022, 140 : 29 - 41
  • [4] JOINT BLIND DEREVERBERATION AND SEPARATION OF SPEECH MIXTURES
    Jan, Tariqullah
    Wang, Wenwu
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2343 - 2347
  • [5] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
    Yu, Ho-Gun
    Kim, Do-Hui
    Song, Min-Hwan
    Park, Hyung-Min
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
  • [6] Blind Source Separation of Noisy Mixed Speech Signals
    Li, Huiya
    Shi, Jianying
    Men, Jinxi
    [J]. SENSORS, MEASUREMENT AND INTELLIGENT MATERIALS II, PTS 1 AND 2, 2014, 475-476 : 291 - +
  • [7] Blind Separation and Dereverberation of Speech Mixtures by Joint Optimization
    Yoshioka, Takuya
    Nakatani, Tomohiro
    Miyoshi, Masato
    Okuno, Hiroshi G.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (01): : 69 - 84
  • [8] Bayesian Integration of Sound Source Separation and Speech Recognition: A New Approach to Simultaneous Speech Recognition
    Itakura, Kousuke
    Nishimuta, Izaya
    Bando, Yoshiaki
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 736 - 740
  • [9] Over-determined Speech Source Separation and Dereverberation
    Togami, Masahito
    Scheibler, Robin
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 705 - 710
  • [10] Sound Source Separation and Automatic Speech Recognition for Moving Sources
    Nakadai, Kazuhiro
    Nakajima, Hirofumi
    Ince, Goekhan
    Hasegawa, Yuji
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 976 - 981