A Signal Processing Approach for Speaker Separation using SFF Analysis

被引:0
|
作者
Chennupati, Nivedita [1 ]
Murthy, B. H. V. S. Narayana [1 ,2 ]
Yegnanarayana, B. [1 ]
机构
[1] Int Inst Informat Technol, Speech Proc Lab, Hyderabad, Telangana, India
[2] Res Ctr Imarat, Hyderabad, Telangana, India
关键词
Multi-speaker separation; single frequency filtering (SFF); time delay estimation; binary mask; SPEECH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-speaker separation is necessary to increase intelligibility of speech signals or to improve accuracy of speech recognition systems. Ideal binary mask (IBM) has set a gold standard for speech separation by suppressing the undesired speakers and also by increasing intelligibility of the desired speech. In this work, single frequency filtering (SFF) analysis is used to estimate the mask closer to IBM for speaker separation. The SFF analysis gives good temporal resolution for extracting features such as glottal closure instants (GCIs), and high spectral resolution for resolving harmonics. The temporal resolution in SFF gives impulse locations, which are used to calculate the time delay. The delay compensation between two microphone signals reinforces the impulses corresponding to one of the speakers. The spectral resolution of the SFF is exploited to estimate the masks using the SFF magnitude spectra on the enhanced impulse-like sequence corresponding to one of the speakers. The estimated mask is used to refine the SFF magnitude. The refined SFF magnitude along with the phase of the mixed microphone signal is used to obtain speaker separation. Performance of proposed algorithm is demonstrated using multi-speaker data collected in a real room environment.
引用
收藏
页码:2034 / 2035
页数:2
相关论文
共 50 条
  • [21] ECG signal processing using multiresolution analysis
    Department of Electronics, Sciences Engineering Faculty, Abou-Bekr Belkaïd University, BP 230, Tlemcen 13000, Algeria
    J. Med. Eng. Technol., 2008, 6 (466-478):
  • [22] Unsupervised Speech Separation Using Statistical, Auditory and Signal Processing Approaches
    Hemavathi, R.
    Swamy, R. Kumara
    2018 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2018,
  • [23] Signal processing for state estimation during sensor failure using mixed signal separation technique
    Ravi, S
    Jayasingh, T
    ICCS 2002: 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2002, : 765 - 768
  • [24] On a decoupled approach to adaptive signal separation using an antenna array
    Larsson, EG
    Stoica, P
    Li, J
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2002, 51 (06) : 1681 - 1685
  • [25] A decoupled approach to adaptive signal separation using an antenna array
    Ranheim, A
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 1999, 48 (03) : 676 - 682
  • [26] Blind source separation using the maximum signal fraction approach
    Hundley, DR
    Kirby, MJ
    Anderle, M
    SIGNAL PROCESSING, 2002, 82 (10) : 1505 - 1508
  • [27] An investigation into front-end signal processing for speaker normalization
    Umesh, S
    Sinha, R
    Kumar, SVB
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 345 - 348
  • [28] Application of Blind-Signal-Processing Algorithm in Image Separation Blind-Signal-Processing in Image Separation
    Chen, Chuen-Yau
    Lin, Cheng-Yuan
    Liu, Wei-Ching
    Chen, Yen-Ting
    2019 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2019, : 158 - 159
  • [29] Signal separation method using independent component analysis
    Yoshioka, M
    Omatu, S
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 891 - 894
  • [30] Signal separation method using independent component analysis
    Yoshioka, M
    Omatu, S
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 753 - 756