HMM-based speech enhancement using sub-word models and noise adaptation

被引：2

作者：

Kato, Akihiro ^{[1
]}

Milner, Ben ^{[1
]}

机构：

[1] Univ East Anglia, Norwich, Norfolk, England

来源：

17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES | 2016年

关键词：

speech enhancement; HMMs; STRAIGHT; noise adaptation; FREQUENCY;

D O I：

10.21437/Interspeech.2016-928

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This work proposes a method of speech enhancement that uses a network of HMMs to first decode noisy speech and to then synthesise a set of features that enables a clean speech signal to be reconstructed. Different choices of acoustic model (whole-word, monophone and triphone) and grammars (highly constrained to no constraints) are considered and the effects of introducing or relaxing acoustic and grammar constraints investigated. For robust operation in noisy conditions it is necessary for the HMMs to model noisy speech and consequently noise adaptation is investigated along with its effect on the reconstructed speech. Speech quality and intelligibility analysis find triphone models with no grammar, combined with noise adaptation, gives highest performance that outperforms conventional methods of enhancement at low signal-to-noise ratios.

引用

页码：3748 / 3752

页数：5

共 50 条

[1] HMM-based noise estimator for speech enhancement
许春冬
夏日升
应冬文
李军锋
颜永红
[J]. Journal of Beijing Institute of Technology, 2014, 23 (04) : 549 - 556
[2] HMM-based noise estimator for speech enhancement
[J]. Xia, Ri-Sheng (xiarisheng@hccl.ioa.ac.cn), 1600, Beijing Institute of Technology (23):
[3] HMM-based gain modeling for enhancement of speech in noise
Zhao, David Y.
Kleijn, W. Bastiaan
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 882 - 892
[4] HMM-based speech enhancement using harmonic modeling
Deisher, ME
Spanias, AS
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1175 - 1178
[5] HMM-Based strategies for enhancement of speech signals embedded in nonstationary noise
Sameti, H
Sheikhzadeh, H
Deng, L
Brennan, RL
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 445 - 455
[6] An acoustic model adaptation using hmm-based speech synthesis
Tanaka, K
Kuroiwa, S
Tsuge, S
Ren, F
[J]. 2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 368 - 373
[7] Unsupervised adaptation for HMM-based speech synthesis
King, Simon
Tokuda, Keiichi
Zen, Heiga
Yamagishi, Junichi
[J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1869 - +
[8] HMM-based speech enhancement using explicit gain modeling
Zhao, David Y.
Kleijn, W. Bastiaan
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 161 - 164
[9] Noise in HMM-Based Speech Synthesis Adaptation: Analysis, Evaluation Methods and Experiments
Karhila, Reima
Remes, Ulpu
Kurimo, Mikko
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (02) : 285 - 295
[10] A cepstrum domain HMM-based speech enhancement method applied to nonstationary noise
Nilsson, M
Dahl, M
Claesson, I
[J]. SIGNAL PROCESSING FOR TELECOMMUNICATIONS AND MULTIMEDIA, 2005, 27 : 1 - 13

← 1 2 3 4 5 →