Underdetermined blind source separation using CapsNet

被引:6
|
作者
Kumar, M. [1 ]
Jayanthi, V. E. [2 ]
机构
[1] Chettinad Coll Engn & Technol, Karurtrichy Highways,Puliyur CF PO, Karur, Tamil Nadu, India
[2] PSNA Coll Engn & Technol, Dindigul, Tamil Nadu, India
关键词
Array signal processing; Blind source separation; Capsule networks; Speech recognition; Time-frequency masking; CONVOLUTIVE MIXTURES;
D O I
10.1007/s00500-019-04430-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider the problem of separating the speech source signal from the underdetermined convolutive mixture signals using capsule network (CapsNet). The objective of this paper is twofold. They are (1) to improve the underdetermined convolutive blind source separation algorithm in terms of signal-to-distortion ratio, signal-to-interference ratio and signal-to-artifact ratio; (2) to minimize the computational burden of the algorithm so that it is useful for applications like speech recognition system. The time-frequency points of the observed mixture signals are input to the first layer of CapsNet. In the first layer, single-source active point (SSP) is calculated using the ratio of mixtures. These SSPs are lower-level capsules in our system. In the second layer, we find a cluster center using a dynamic routing algorithm and these clusters are used to construct a binary mask. Finally, the algorithm solves the permutation problem by determining the correlation between the amplitudes of adjacent frequency bins. We test our algorithm on the live recording mixture signals obtained in the real environment and synthetically convoluted mixture signals. The test result shows the effectiveness of the proposed method when compared with the existing algorithms in terms of computational load, signal-to-distortion ratio and signal-to-interference ratio.
引用
收藏
页码:9011 / 9019
页数:9
相关论文
共 50 条
  • [1] Underdetermined blind source separation using CapsNet
    M. Kumar
    V. E. Jayanthi
    [J]. Soft Computing, 2020, 24 : 9011 - 9019
  • [2] Underdetermined blind source separation using SVM
    Yang Zuyuan
    Luo Shiguang
    Chen Caiyun
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 3, PROCEEDINGS, 2007, 4493 : 803 - +
  • [3] Underdetermined Blind Source Separation Using Linear Separation System
    Cermak, Jan
    Smekal, Zdenek
    [J]. MULTIMODAL SIGNAL: COGNITIVE AND ALGORITHMIC ISSUES, 2009, 5398 : 300 - +
  • [4] Underdetermined blind source separation using sparse representations
    Bofill, P
    Zibulevsky, M
    [J]. SIGNAL PROCESSING, 2001, 81 (11) : 2353 - 2362
  • [5] Underdetermined Blind Source Separation Using Sparse Coding
    Zhen, Liangli
    Peng, Dezhong
    Yi, Zhang
    Xiang, Yong
    Chen, Peng
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 3102 - 3108
  • [6] Underdetermined blind source separation in echoic environments using DESPRIT
    Melia, Thomas
    Rickard, Scott
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)
  • [7] Underdetermined Blind Audio Source Separation Using Modal Decomposition
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [8] Underdetermined Blind Audio Source Separation Using Modal Decomposition
    Abdeldjalil Aïssa-El-Bey
    Karim Abed-Meraim
    Yves Grenier
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [9] Underdetermined Blind Source Separation in Echoic Environments Using DESPRIT
    Thomas Melia
    Scott Rickard
    [J]. EURASIP Journal on Advances in Signal Processing, 2007
  • [10] Active source selection using gap statistics for underdetermined blind source separation
    Luo, Y
    Chambers, J
    [J]. SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 137 - 140