Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition

被引:0
|
作者
Takeda, Ryu [1 ]
Nakadai, Kazuhiro [2 ]
Komatani, Kazunori [1 ]
Ogata, Tetsuya [1 ]
Okuno, Hiroshi G. [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[2] Honda Res Inst, Wako, Saitama 3510114, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a new semi-blind source separation (semi-BSS) technique with independent component analysis (ICA) for enhancing a target source of interest and for suppressing other known interference sources. The semi-BSS technique is necessary for double-talk free robot audition systems in order to utilize known sound source signals such as self speech, music, or TV-sound, through a line-in or ubiquitous network. Unlike the conventional semi-BSS with ICA, we use the time-frequency domain convolution model to describe the reflection of the sound and a new mixing process of sounds for ICA. In other words, we consider that reflected sounds during some delay time are different from the original. ICA then separates the reflections as other interference sources. The model enables us to eliminate the frame size limitations of the frequency-domain ICA, and ICA can separate the known sources under a highly reverberative environment. Experimental results show that our method outperformed the conventional semi-BSS using ICA under simulated normal and highly reverberative environments.
引用
收藏
页码:1763 / +
页数:2
相关论文
共 50 条
  • [1] A new geometrical ICA-based method for blind separation of speech signals
    Rodríguez-Alvarez, M
    Rojas, F
    Lang, EW
    Rojas, I
    [J]. ARTIFICIAL NEURAL NETS PROBLEM SOLVING METHODS, PT II, 2003, 2687 : 281 - 288
  • [2] Improvement of robot audition by interfacing sound source separation and automatic speech recognition with missing feature theory
    Yamamoto, S
    Nakadai, K
    Tsujino, H
    Yokoyama, T
    Okuno, HG
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 1517 - 1523
  • [3] SOUND SOURCE SEPARATION OF MOVING SPEAKERS FOR ROBOT AUDITION
    Nakadai, Kazuhiro
    Nakajima, Hirofumi
    Hasegawa, Yuji
    Tsujino, Hiroshi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3685 - 3688
  • [4] ICA-based Noise Reduction Algorithm for Speech Recognition
    Xu, Yang
    Liu, Ting
    [J]. INTERNATIONAL CONFERENCE ON FRONTIERS OF ENERGY, ENVIRONMENTAL MATERIALS AND CIVIL ENGINEERING (FEEMCE 2013), 2013, : 779 - 784
  • [5] ICA-Based Receiver for An Optimal Separation of CDMA Signals
    Hamza, Abdelkrim
    Chitroub, Salim
    [J]. AFRICAN REVIEW OF PHYSICS, 2008, 2 : 50 - 52
  • [6] Sound Source Separation for Robot Audition using Deep Learning
    Noda, Kuniaki
    Hashimoto, Naoya
    Nakadai, Kazuhiro
    Ogata, Tetsuya
    [J]. 2015 IEEE-RAS 15TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2015, : 389 - 394
  • [7] Blind source separation of speech signals based on an ICA geometric procedure
    Rodríguez-Alvarez, M
    Rojas, F
    Salmerón, M
    Rojas, I
    Ros, E
    Puntonet, CG
    [J]. PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 631 - 636
  • [8] Recognition of simultaneous speech by estimating reliability of separated signals for robot audition
    Yamamoto, Shun'ichi
    Takeda, Ryu
    Nakadai, Kazuhiro
    Nakano, Mikio
    Tsujino, Hiroshi
    Valin, Jean-Marc
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    [J]. PRICAI 2006: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4099 : 484 - 494
  • [9] An ICA-Based Method for Blind Source Separation in Sparse Domains
    Nadalin, Everton Z.
    Suyama, Ricardo
    Attux, Romis
    [J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 597 - +
  • [10] ICA-BASED EFFICIENT BLIND DEREVERBERATION AND ECHO CANCELLATION METHOD FOR BARGE-IN-ABLE ROBOT AUDITION
    Takeda, Ryu
    Nakadai, Kazuhiro
    Takahashi, Toru
    Komatani, Kazunori
    Ogata, Telsuya
    Okuno, Hiroshi G.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3677 - +