Underdetermined blind source separation using CapsNet

被引:6
|
作者
Kumar, M. [1 ]
Jayanthi, V. E. [2 ]
机构
[1] Chettinad Coll Engn & Technol, Karurtrichy Highways,Puliyur CF PO, Karur, Tamil Nadu, India
[2] PSNA Coll Engn & Technol, Dindigul, Tamil Nadu, India
关键词
Array signal processing; Blind source separation; Capsule networks; Speech recognition; Time-frequency masking; CONVOLUTIVE MIXTURES;
D O I
10.1007/s00500-019-04430-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider the problem of separating the speech source signal from the underdetermined convolutive mixture signals using capsule network (CapsNet). The objective of this paper is twofold. They are (1) to improve the underdetermined convolutive blind source separation algorithm in terms of signal-to-distortion ratio, signal-to-interference ratio and signal-to-artifact ratio; (2) to minimize the computational burden of the algorithm so that it is useful for applications like speech recognition system. The time-frequency points of the observed mixture signals are input to the first layer of CapsNet. In the first layer, single-source active point (SSP) is calculated using the ratio of mixtures. These SSPs are lower-level capsules in our system. In the second layer, we find a cluster center using a dynamic routing algorithm and these clusters are used to construct a binary mask. Finally, the algorithm solves the permutation problem by determining the correlation between the amplitudes of adjacent frequency bins. We test our algorithm on the live recording mixture signals obtained in the real environment and synthetically convoluted mixture signals. The test result shows the effectiveness of the proposed method when compared with the existing algorithms in terms of computational load, signal-to-distortion ratio and signal-to-interference ratio.
引用
收藏
页码:9011 / 9019
页数:9
相关论文
共 50 条
  • [21] Underdetermined blind source separation of temporomandibular joint sounds
    Took, Clive Cheong
    Sanei, Saeid
    Chambers, Jonathon
    Dunne, Stephen
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2006, 53 (10) : 2123 - 2126
  • [22] Underdetermined Blind Source Separation Based Condition Monitoring
    Vinaya, Anindita Adikaputri
    Arifianto, Dhany
    [J]. 2015 INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH), 2015, : 47 - 52
  • [23] Underdetermined Blind Source Separation Based on Subspace Representation
    Kim, SangGyun
    Yoo, Chang D.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (07) : 2604 - 2614
  • [24] Underdetermined Sparse Blind Source Separation by Clustering on Hyperplanes
    Tan Beihai
    Zhao Min
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL I, 2009, : 270 - 274
  • [25] Underdetermined Blind Source Separation in Single Mixtures Signal
    Cheng Xie-feng
    Tao Ye-wei
    Zhang Shao-bai
    Li Jian-yin
    Ma Yong-hua
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4273 - +
  • [26] Underdetermined blind source separation based on sparse representation
    Li, YQ
    Amari, SI
    Cichocki, A
    Ho, DWC
    Xie, SL
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (02) : 423 - 437
  • [27] Underdetermined Blind Source Separation Based on Sparse Component
    Ren, Ming-rong
    Wang, Pu
    [J]. ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 174 - 177
  • [28] Underdetermined Joint Blind Source Separation of Multiple Datasets
    Zou, Liang
    Chen, Xun
    Ji, Xiangyang
    Wang, Z. Jane
    [J]. IEEE ACCESS, 2017, 5 : 7474 - 7487
  • [29] AN ONLINE ADAPTIVE ALGORITHM FOR UNDERDETERMINED BLIND SOURCE SEPARATION
    Zhang, Ye
    Wu, Kangrui
    Tan, Gangyan
    Wu, Jianhua
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 467 - 472
  • [30] Underdetermined blind separation of source using lp-norm diversity measures
    Xie, Yuan
    Xie, Kan
    Xie, Shengli
    [J]. NEUROCOMPUTING, 2020, 411 : 259 - 267