A Signal Processing Approach for Speaker Separation using SFF Analysis

被引:0
|
作者
Chennupati, Nivedita [1 ]
Murthy, B. H. V. S. Narayana [1 ,2 ]
Yegnanarayana, B. [1 ]
机构
[1] Int Inst Informat Technol, Speech Proc Lab, Hyderabad, Telangana, India
[2] Res Ctr Imarat, Hyderabad, Telangana, India
关键词
Multi-speaker separation; single frequency filtering (SFF); time delay estimation; binary mask; SPEECH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-speaker separation is necessary to increase intelligibility of speech signals or to improve accuracy of speech recognition systems. Ideal binary mask (IBM) has set a gold standard for speech separation by suppressing the undesired speakers and also by increasing intelligibility of the desired speech. In this work, single frequency filtering (SFF) analysis is used to estimate the mask closer to IBM for speaker separation. The SFF analysis gives good temporal resolution for extracting features such as glottal closure instants (GCIs), and high spectral resolution for resolving harmonics. The temporal resolution in SFF gives impulse locations, which are used to calculate the time delay. The delay compensation between two microphone signals reinforces the impulses corresponding to one of the speakers. The spectral resolution of the SFF is exploited to estimate the masks using the SFF magnitude spectra on the enhanced impulse-like sequence corresponding to one of the speakers. The estimated mask is used to refine the SFF magnitude. The refined SFF magnitude along with the phase of the mixed microphone signal is used to obtain speaker separation. Performance of proposed algorithm is demonstrated using multi-speaker data collected in a real room environment.
引用
收藏
页码:2034 / 2035
页数:2
相关论文
共 50 条
  • [41] A Statistical Signal Processing Approach in Wireless Network Traffic Analysis
    Chowdhury, Sajib
    Paul, Swagata
    Chatterjee, Debraj
    Mukherjee, Somenath
    Ghosal, Sandipan
    Goswami, Radha Tamal
    2018 INTERNATIONAL CONFERENCE ON COMPUTING, POWER AND COMMUNICATION TECHNOLOGIES (GUCON), 2018, : 70 - 73
  • [42] ANALYSIS OF COAL AND MINERAL PROCESSING CIRCUITS, SIGNAL FLOWGRAPH APPROACH
    SALAMA, AIA
    MIKHAIL, MW
    CIM BULLETIN, 1987, 80 (901): : 51 - 53
  • [43] Signal processing in movement analysis (a state-space approach)
    Fioretti, S
    HUMAN MOVEMENT SCIENCE, 1996, 15 (03) : 389 - 410
  • [44] Audio signal processing via harmonic separation using variable Laguerre filters
    Tay, DBH
    Abeysekera, SS
    Balasuriya, AP
    PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III: GENERAL & NONLINEAR CIRCUITS AND SYSTEMS, 2003, : 558 - 561
  • [45] Development of Pile-up Separation Method Using Digital Signal Processing
    Oishi, Takuji
    Baba, Mamoru
    JOURNAL OF NUCLEAR SCIENCE AND TECHNOLOGY, 2008, : 375 - 378
  • [46] A Study on Continuous Phase Signal Separation and Demodulation Method Using Stored Data Batch Signal Processing
    Hikasa, Tomofumi
    Hirakawa, Takuyuki
    Nakaie, Syo
    Tomisato, Shigeru
    Denno, Satoshi
    Uehara, Kazuhiro
    IEICE COMMUNICATIONS EXPRESS, 2022, 11 (12): : 734 - 740
  • [47] User Identification System Using Biometrics Speaker Recognition by MFCC and DTW along with signal processing package
    Muttaqi, Tazwar
    Mousavinezhad, S. Hossein
    Mahamud, Shaikh
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 79 - 83
  • [48] Spectral analysis techniques using Prism signal processing
    Henry, Manus
    MEASUREMENT, 2021, 169
  • [49] Analysis of Finger Thermoregulation by Using Signal Processing Techniques
    Henao Higuita, Maria Camila
    Hernandez Fernandez, Macheily
    Aristizabal Martinez, Delio
    Fandino Toro, Hermes
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2019), PT II, 2019, 11466 : 537 - 549
  • [50] Monaural Speech Separation Using Speaker Embedding From Preliminary Separation
    Byun, Jaeuk
    Shin, Jong Won
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2753 - 2763