Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model

被引:34
|
作者
Fisher, E
Tabrikian, J
Dubnov, S
机构
[1] Ben Gurion Univ Negev, Dept Elect & Comp Engn, IL-84105 Beer Sheva, Israel
[2] Univ Calif San Diego, Dept Mus, La Jolla, CA 92093 USA
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2006年 / 14卷 / 02期
基金
美国国家科学基金会;
关键词
generalized likelihood ratio test (GLRT); harmonic model; likelihood ratio test (LRT); maximum a-posteriori probability; noisy speech; pitch tracking; voice activity detection (VAD); voiced-unvoiced decision;
D O I
10.1109/TSA.2005.857806
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a novel method for voiced-unvoiced decision within a pitch tracking algorithm is presented. Voiced-unvoiced decision is required for many applications, including modeling for analysis/synthesis, detection of model changes for segmentation purposes and signal characterization for indexing and recognition applications. The proposed method is based on the generalized likelihood ratio test (GLRT) and assumes colored Gaussian noise with unknown covariance. Under voiced hypothesis, a harmonic plus noise model is assumed. The derived method is combined with a maximum a-posteriori probability (MAP) scheme to obtain a pitch and voicing tracking algorithm. The performance of the proposed method is tested using several speech databases for different levels of additive noise and phone speech conditions. Results show that the GLRT is robust to speaker and environmental conditions and performs better than existing algorithms.
引用
收藏
页码:502 / 510
页数:9
相关论文
共 50 条
  • [1] Generalized likelihood ratio test for voiced/unvoiced decision using the harmonic plus noise model
    Fisher, E
    Tabrikian, J
    Dubnov, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 440 - 443
  • [2] Speech enhancement based on a voiced-unvoiced speech model
    Goh, Z
    Tan, KC
    Tan, BTG
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 401 - 404
  • [3] Global Soft Decision Based Speech Enhancement Using Voiced-Unvoiced Uncertainty and Harmonic Phase Decomposition Technique
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [4] VOICED-UNVOICED CLASSIFICATION OF SPEECH USING AUTOCORRELATION MATRIX
    Senturk, Zekeriya
    Yetgin, Omer Emre
    Salor, Ozgul
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1802 - 1805
  • [5] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
    O'Shaughnessy, Douglas
    Tolba, Hesham
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 413 - 416
  • [6] A multifeature voiced/unvoiced decision algorithm for noisy speech
    Shahnaz, C.
    Zhu, W. -P.
    Ahmad, M. O.
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2525 - +
  • [7] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
    O'Shaughnessy, D
    Tolba, H
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 413 - 416
  • [8] A pitch determination and voiced/unvoiced decision algorithm for noisy speech
    Rouat, J
    Liu, YC
    Morissette, D
    SPEECH COMMUNICATION, 1997, 21 (03) : 191 - 207
  • [9] Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model
    Goh, Z
    Tan, KC
    Tan, BTG
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 510 - 524
  • [10] Voiced-Unvoiced Classification of Speech Using a Neural Network Trained with LPC Coefficients
    Struwe, Kevin
    2017 INTERNATIONAL CONFERENCE ON CONTROL, ARTIFICIAL INTELLIGENCE, ROBOTICS & OPTIMIZATION (ICCAIRO), 2017, : 56 - 59