Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function

被引:0
|
作者
Zhang Ye [1 ]
Cao Kang [1 ]
Wu Kangrui [1 ]
Yu Tenglong [1 ]
Zhou Nanrun [1 ,2 ]
机构
[1] Nanchang Univ, Dept Elect Informat Engn, Nanchang 330031, Peoples R China
[2] Beijing Univ Posts & Telecommun, Natl Engn Lab Disaster Backup & Recovery, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
underdetermined blind source separation; interaural time difference; interaural level difference; visual information; Gaussian potential function; SPEECH; SIGNALS;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Most existing algorithms for the underdetermined blind source separation (UBSS) problem are two-stage algorithm, i.e., mixing parameters estimation and sources estimation. In the mixing parameters estimation, the previously proposed traditional clustering algorithms are sensitive to the initializations of the mixing parameters. To reduce the sensitiveness to the initialization, we propose a new algorithm for the UBSS problem based on anechoic speech mixtures by employing the visual information, i.e., the interaural time difference (ITD) and the interaural level difference (ILD), as the initializations of the mixing parameters. In our algorithm, the video signals are utilized to estimate the distances between microphones and sources, and then the estimations of the ITD and ILD can be obtained. With the sparsity assumption in the time-frequency domain, the Gaussian potential function algorithm is utilized to estimate the mixing parameters by using the ITDs and ILDs as the initializations of the mixing parameters. And the time-frequency masking is used to recover the sources by evaluating the various ITDs and ILDs. Experimental results demonstrate the competitive performance of the proposed algorithm compared with the baseline algorithms.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 50 条
  • [1] Audio-Visual Underdetermined Blind Source Separation Algorithm Based on Gaussian Potential Function
    ZHANG Ye
    CAO Kang
    WU Kangrui
    YU Tenglong
    ZHOU Nanrun
    [J]. China Communications, 2014, 11 (06) : 71 - 80
  • [2] An Efficient Algorithm for Underdetermined Blind Source Separation of Audio Mixtures
    Dutta, Malay Kishore
    Gupta, Phalguni
    Pathak, Vinay K.
    [J]. 2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 136 - +
  • [3] Developing an audio-visual speech source separation algorithm
    Sodoyer, D
    Girin, L
    Jutten, C
    Schwartz, JL
    [J]. SPEECH COMMUNICATION, 2004, 44 (1-4) : 113 - 125
  • [4] Source recovery of underdetermined blind source separation based on SCMP algorithm
    Fu, Weihong
    Chen, Jiehu
    Yang, Bo
    [J]. IET SIGNAL PROCESSING, 2017, 11 (07) : 877 - 883
  • [5] Underdetermined blind source separation algorithm based on A-DBSCAN
    Ji, Ce
    Mu, Wenhuan
    Geng, Rong
    [J]. Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (12): : 2676 - 2683
  • [6] Audio-Visual Based Online Multi-Source Separation
    Ong, Jonah
    Vo, Ba Tuong
    Nordholm, Sven
    Vo, Ba-Ngu
    Moratuwage, Diluka
    Shim, Changbeom
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1219 - 1234
  • [7] Underdetermined Blind Audio Source Separation Using Modal Decomposition
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [8] Underdetermined Blind Audio Source Separation Using Modal Decomposition
    Abdeldjalil Aïssa-El-Bey
    Karim Abed-Meraim
    Yves Grenier
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [9] Algorithm for source recovery in underdetermined blind source separation based on plane pursuit
    FU Weihong
    WEI Juan
    LIU Naian
    CHEN Jiehu
    [J]. Journal of Systems Engineering and Electronics, 2018, 29 (02) : 223 - 228
  • [10] Algorithm for source recovery in underdetermined blind source separation based on plane pursuit
    Fu Weihong
    Wei Juan
    Liu Naian
    Chen Jiehu
    [J]. JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2018, 29 (02) : 223 - 228