Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method

被引:0
|
作者
Azar Mahmoodzadeh
Hamid Reza Abutalebi
Hamid Soltanian-Zadeh
Hamid Sheikhzadeh
机构
[1] Yazd University,Speech Processing Research Lab (SPRL), Electrical and Computer Engineering Department
[2] University of Tehran,Control and Intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering
[3] Henry Ford Health System,Image Analysis Laboratory, Department of Radiology
[4] Amirkabir University of Technology,Electrical Engineering Department
关键词
acoustic frequency; modulation frequency; onset and offset algorithm; pitch range estimation; speech separation;
D O I
暂无
中图分类号
学科分类号
摘要
Computational Auditory Scene Analysis (CASA) has been the focus in recent literature for speech separation from monaural mixtures. The performance of current CASA systems on voiced speech separation strictly depends on the robustness of the algorithm used for pitch frequency estimation. We propose a new system that estimates pitch (frequency) range of a target utterance and separates voiced portions of target speech. The algorithm, first, estimates the pitch range of target speech in each frame of data in the modulation frequency domain, and then, uses the estimated pitch range for segregating the target speech. The method of pitch range estimation is based on an onset and offset algorithm. Speech separation is performed by filtering the mixture signal with a mask extracted from the modulation spectrogram. A systematic evaluation shows that the proposed system extracts the majority of target speech signal with minimal interference and outperforms previous systems in both pitch extraction and voiced speech separation.
引用
收藏
相关论文
共 50 条
  • [1] Single channel speech separation in modulation frequency domain based on a novel pitch range estimation method
    Mahmoodzadeh, Azar
    Abutalebi, Hamid Reza
    Soltanian-Zadeh, Hamid
    Sheikhzadeh, Hamid
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [2] Single-channel speech separation based on modulation frequency
    Gu, Lingyun
    Stern, Richard M.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 25 - 28
  • [3] On the estimation of pitch of noisy speech based on time and frequency domain representations
    Shahnaz, C.
    Zhu, W. -P.
    Ahmad, M. O.
    [J]. 2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 1741 - 1744
  • [4] A Pitch State Dependent Dictionary Design Method for Single-Channel Speech Separation
    Guo, Haiyan
    Yang, Zhen
    Zhang, Linghua
    Ye, Lei
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [5] Impact of phase estimation on single-channel speech separation based on time-frequency masking
    Mayer, Florian
    Williamson, Donald S.
    Mowlaee, Pejman
    Wang, DeLiang
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : 4668 - 4679
  • [6] GAIN ESTIMATION IN MODEL-BASED SINGLE CHANNEL SPEECH SEPARATION
    Radfar, M. H.
    Wong, W.
    Chan, W-Y.
    Dansereau, R. M.
    [J]. 2009 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2009, : 423 - +
  • [7] Complex tensor factorization in modulation frequency domain for single-channel speech enhancement
    Masaya, Shogo
    Unoki, Masashi
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1765 - 1769
  • [8] Single-channel speech enhancement based on frequency domain ALE
    Nakanishi, Isao
    Nagata, Yuudai
    Itoh, Yoshio
    Fukui, Yutaka
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2541 - 2544
  • [9] Feasibility of single channel speaker separation based on modulation frequency analysis
    Schimmel, Steven M.
    Atlas, Les E.
    Nie, Kaibao
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 605 - +
  • [10] A Novel Frequency Domain Channel Estimation Method for Optical OFDM/OQAM
    Fang, Xi
    Suo, Zhufeng
    Li, Li
    Zhang, Lei
    Gao, Xianwei
    [J]. 2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2018, : 120 - 124