Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function

被引:0
|
作者
Xueliang Zhang
Wenju Liu
Bo Xu
机构
[1] Chinese Academy of Sciences,National Laboratory of Pattern Recognition (NLPR), Institute of Automation
[2] Inner Mongolia University,Computer Science Department
关键词
Harmonic Order; Clean Speech; Complex Tone; Pitch Contour; Pitch Period;
D O I
暂无
中图分类号
学科分类号
摘要
Correlogram is an important representation for periodic signals. It is widely used in pitch estimation and source separation. For these applications, major problems of correlogram are its low resolution and redundant information. This paper proposes a voiced speech segregation system based on a newly introduced concept called dynamic harmonic function (DHF). In the proposed system, conventional correlograms are further processed by replacing the autocorrelation function (ACF) with DHF. The advantages of DHF are: 1) peak's width is adjustable by controlling the variance of the Gaussian function and 2) the invalid peaks of ACF, not at the pitch period, tend to be suppressed. Based on DHF, pitch detection and effective source segregation algorithms are proposed. Our system is systematically evaluated and compared with the correlogram-based system. Both the signal-to-noise ratio results and the perceptual evaluation of speech quality scores show that the proposed system yields substantially better performance.
引用
收藏
相关论文
共 50 条
  • [1] Monaural Voiced Speech Segregation Based on Dynamic Harmonic Function
    Zhang, Xueliang
    Liu, Wenju
    Xu, Bo
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,
  • [2] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
    Liu WenJu
    Zhang XueLiang
    Jiang Wei
    Li Peng
    Xu Bo
    [J]. SCIENCE CHINA-INFORMATION SCIENCES, 2011, 54 (12) : 2471 - 2480
  • [3] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
    WenJu Liu
    XueLiang Zhang
    Wei Jiang
    Peng Li
    Bo Xu
    [J]. Science China Information Sciences, 2011, 54 : 2471 - 2480
  • [4] Monaural voiced speech segregation based on elaborate harmonic grouping strategies
    LIU WenJu 1
    2 Digital Media Content Technology Research Center
    [J]. Science China(Information Sciences), 2011, 54 (12) : 2491 - 2500
  • [5] MONAURAL VOICED SPEECH SEGREGATION BASED ON ELABORATE HARMONIC GROUPING STRATEGY
    Zhang, Xueliang
    Liu, Wenju
    Li, Peng
    Xu, Bo
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4661 - +
  • [6] Monaural Voiced Speech Segregation Based on Pitch and Comb Filter
    Zhang, Xueliang
    Liu, Wenju
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1752 - +
  • [7] Monaural Segregation of Voiced Speech using Discriminative Random Fields
    Prabhavalkar, Rohit
    Jin, Zhaozhang
    Fosler-Lussier, Eric
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 864 - 867
  • [8] Improving Speech Intelligibility in Monaural Segregation System by Fusing Voiced and Unvoiced Speech Segments
    Shoba, S.
    Rajavel, R.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3573 - 3590
  • [9] Improving Speech Intelligibility in Monaural Segregation System by Fusing Voiced and Unvoiced Speech Segments
    S. Shoba
    R. Rajavel
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 3573 - 3590
  • [10] Accurate Labeling of Time-Frequency Units in Monaural Voiced Speech Segregation
    Shamlou, Sanam Imani
    Geravanchizadeh, Masoud
    [J]. 2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 902 - 906