Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function

被引:0
|
作者
ZHANG Ye [1 ]
CAO Kang [1 ]
WU Kangrui [1 ]
YU Tenglong [1 ]
ZHOU Nanrun [1 ,2 ]
机构
[1] Department of Electronic Information Engineering, Nanchang University
[2] National Engineering Laboratory for Disaster Backup and Recovery, Beijing University of Posts and Telecommunications
基金
中国国家自然科学基金;
关键词
underdetermined blind source separation; interaural time difference; interaural level difference; visual information; Gaussian potential function;
D O I
暂无
中图分类号
TN911.7 [信号处理];
学科分类号
0711 ; 080401 ; 080402 ;
摘要
Most existing algorithms for the underdetermined blind source separation(UBSS) problem are two-stage algorithm, i.e., mixing parameters estimation and sources estimation. In the mixing parameters estimation, the previously proposed traditional clustering algorithms are sensitive to the initializations of the mixing parameters. To reduce the sensitiveness to the initialization, we propose a new algorithm for the UBSS problem based on anechoic speech mixtures by employing the visual information, i.e., the interaural time difference(ITD) and the interaural level difference(ILD), as the initializations of the mixing parameters. In our algorithm, the video signals are utilized to estimate the distances between microphones and sources, and then the estimations of the ITD and ILD can be obtained. With the sparsity assumption in the time-frequency domain, the Gaussian potential function algorithm is utilized to estimate the mixing parameters by using the ITDs and ILDs as the initializations of the mixing parameters. And the time-frequency masking is used to recover the sources by evaluating the various ITDs and ILDs. Experimental results demonstrate the competitive performance of the proposed algorithm compared with the baseline algorithms.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 50 条
  • [31] A COMPRESSED SENSING APPROACH FOR UNDERDETERMINED BLIND AUDIO SOURCE SEPARATION WITH SPARSE REPRESENTATION
    Xu, Tao
    Wang, Wenwu
    [J]. 2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 493 - 496
  • [32] Mixing Matrix Estimation in Underdetermined Blind Source Separation Based on Objective Function and Artificial Bee Colony Algorithm
    Chen, Yongqiang
    Li, Yingxiang
    Zhou, Juan
    [J]. 2018 IEEE 18TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2018, : 1063 - 1066
  • [33] Underdetermined Blind Source Separation Based Condition Monitoring
    Vinaya, Anindita Adikaputri
    Arifianto, Dhany
    [J]. 2015 INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH), 2015, : 47 - 52
  • [34] Underdetermined Blind Source Separation Based on Subspace Representation
    Kim, SangGyun
    Yoo, Chang D.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2009, 57 (07) : 2604 - 2614
  • [35] Underdetermined Blind Source Separation Based on Sparse Component
    Ren, Ming-rong
    Wang, Pu
    [J]. ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 174 - 177
  • [36] Underdetermined blind source separation based on sparse representation
    Li, YQ
    Amari, SI
    Cichocki, A
    Ho, DWC
    Xie, SL
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (02) : 423 - 437
  • [37] Gaussian Processes for Underdetermined Source Separation
    Liutkus, Antoine
    Badeau, Roland
    Richard, Gaeel
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (07) : 3155 - 3167
  • [38] Underdetermined blind speech source separation based on deep nearest neighbor clustering algorithm
    Mengdie Niu
    Ye Zhang
    [J]. Multimedia Tools and Applications, 2023, 82 : 1171 - 1183
  • [39] Underdetermined blind speech source separation based on deep nearest neighbor clustering algorithm
    Niu, Mengdie
    Zhang, Ye
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (01) : 1171 - 1183
  • [40] Blind Source Separation in Underdetermined Model based on Local Mean Decomposition and AMUSE Algorithm
    Li Wei
    Yang Huizhong
    [J]. 2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 7206 - 7211