HMM-Based Cue Parameters Estimation for Speech Enhancement

被引：0

作者：

Deng, Feng ^{[1
]}

Bao, Chang-chun ^{[1
]}

Jia, Mao-shen ^{[1
]}

机构：

[1] Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

来源：

2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2016年

基金：

中国国家自然科学基金;

关键词：

speech enhancement; HMM; cue parameters; priori information; NOISE;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, a hidden Markov model (HMM)-based cue parameters estimation method for single-channel speech enhancement is proposed, in which the cue parameters of binaural cue coding (BCC) are applied to single-channel speech enhancement system successfully. First, the clean speech and noise signals are considered as the left and right channels of stereo signal, respectively; and the noisy speech is treated as the down-mixed mono signal of BCC method. According to the clean speech and noise data set and the corresponding noisy speech data set, the clean cue parameters and pre-enhanced cue parameters are extracted, respectively. Then the cue HMM is trained offline, which exploits the a priori information about the clean cue parameters and the pre-enhanced cue parameters for speech enhancement. Next, using the trained cue HMM, the clean cue parameters are estimated from noisy speech online. Finally, following the synthesis principle of BCC cue parameters, the speech estimator is constructed for enhancing noisy speech. The test results demonstrate that, for the segmental signal-noise-ratio (SNR), the log spectral distortion and PESQ measures, the proposed method performs better than the reference methods.

引用

页数：4

共 50 条

[31] Unsupervised adaptation for HMM-based speech synthesis
King, Simon
Tokuda, Keiichi
Zen, Heiga
Yamagishi, Junichi
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1869 - +
[32] Thousands of Voices for HMM-based Speech Synthesis
Yamagishi, Junichi
Usabaev, Bela
King, Simon
Watts, Oliver
Dines, John
Tian, Jilei
Hu, Rile
Guan, Yong
Oura, Keiichiro
Tokuda, Keiichi
Karhila, Reima
Kurimo, Mikko
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 416 - +
[33] Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-based Speech Synthesis
Chen, Ling-Hui
Nankaku, Yoshihiko
Zen, Heiga
Tokuda, Keiichi
Ling, Zhen-Hua
Dai, Li-Rong
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1812 - +
[34] Robust HMM-based speech/music segmentation
Ajmera, J
McCowan, IA
Bourlard, H
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 297 - 300
[35] HMM-Based Estimation of Unreliable Spectral Components for Noise Robust Speech Recognition
Borgstroem, Bengt J.
Alwan, Abeer
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1769 - 1772
[36] Analysis of HMM-Based Lombard Speech Synthesis
Raitio, Tuomo
Suni, Antti
Vainio, Martti
Alku, Paavo
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2792 - +
[37] HMM-based Speech Synthesizer for Easily Understandable Speech Broadcasting
Akadomari, Hirokazu
Ishikawa, Kosuke
Kobayashi, Yosuke
Ohta, Kengo
Kishigami, Jay
2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 749 - 750
[38] Speech parameter generation algorithms for HMM-based speech synthesis
Tokuda, K
Yoshimura, T
Masuko, T
Kobayashi, T
Kitamura, T
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1315 - 1318
[39] Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition
Cho, Ji-Won
Park, Hyung-Min
SIGNAL PROCESSING, 2016, 120 : 200 - 208
[40] HMM-based frequency bandwidth extension for speech enhancement using line spectral frequencies
Chen, G
Parsa, V
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 709 - 712

← 1 2 3 4 5 →