Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition

被引:0
|
作者
Takeda, Ryu [1 ]
Nakadai, Kazuhiro [2 ]
Komatani, Kazunori [1 ]
Ogata, Tetsuya [1 ]
Okuno, Hiroshi G. [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[2] Honda Res Inst, Wako, Saitama 3510114, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a new semi-blind source separation (semi-BSS) technique with independent component analysis (ICA) for enhancing a target source of interest and for suppressing other known interference sources. The semi-BSS technique is necessary for double-talk free robot audition systems in order to utilize known sound source signals such as self speech, music, or TV-sound, through a line-in or ubiquitous network. Unlike the conventional semi-BSS with ICA, we use the time-frequency domain convolution model to describe the reflection of the sound and a new mixing process of sounds for ICA. In other words, we consider that reflected sounds during some delay time are different from the original. ICA then separates the reflections as other interference sources. The model enables us to eliminate the frame size limitations of the frequency-domain ICA, and ICA can separate the known sources under a highly reverberative environment. Experimental results show that our method outperformed the conventional semi-BSS using ICA under simulated normal and highly reverberative environments.
引用
收藏
页码:1763 / +
页数:2
相关论文
共 50 条
  • [21] Low-cost design and implementation of an ICA-based blind source separation algorithm
    Sattar, F
    Charayaphan, C
    [J]. 15TH ANNUAL IEEE INTERNATIONAL ASIC/SOC CONFERENCE, PROCEEDINGS, 2002, : 15 - 19
  • [22] MAICA: an ICA-based method for source separation in a low-channel EEG recording
    Rejer, Izabela
    Gorski, Pawel
    [J]. JOURNAL OF NEURAL ENGINEERING, 2019, 16 (05)
  • [23] Two-stage blind source separation based on ICA and binary masking for real-time robot audition system
    Saruwatari, H
    Mori, Y
    Hiekata, T
    Morita, T
    [J]. 2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 209 - 214
  • [24] Sound source separation for a robot based on pitch
    Heckmann, M
    Joublin, F
    Körner, E
    [J]. 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, Vols 1-4, 2005, : 203 - 208
  • [25] Blind dereverberation of single-channel speech signals using an ICA-based generative model
    Lee, JH
    Oh, SH
    Lee, SY
    [J]. NEURAL INFORMATION PROCESSING, 2004, 3316 : 1070 - 1075
  • [26] Sound Source Separation and Automatic Speech Recognition for Moving Sources
    Nakadai, Kazuhiro
    Nakajima, Hirofumi
    Ince, Goekhan
    Hasegawa, Yuji
    [J]. IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010), 2010, : 976 - 981
  • [27] Bayesian Integration of Sound Source Separation and Speech Recognition: A New Approach to Simultaneous Speech Recognition
    Itakura, Kousuke
    Nishimuta, Izaya
    Bando, Yoshiaki
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 736 - 740
  • [28] Order and phase ambiguities correction in the ICA based separation of speech signals
    Uddin, Z.
    Orakzai, F. A.
    Qamar, A.
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 459 - 469
  • [29] Order and phase ambiguities correction in the ICA based separation of speech signals
    Z. Uddin
    F. A. Orakzai
    A. Qamar
    [J]. International Journal of Speech Technology, 2020, 23 : 459 - 469
  • [30] ICA-based clustering for resolving permutation ambiguity in frequency-domain convolutive source separation
    Kim, Minje
    Choi, Seungjin
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2006, : 950 - +