Block-based bandwidth extension of narrowband speech signal by using CDHMM

被引:0
|
作者
Yao, S [1 ]
Chan, CF [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Engn & Informat Technol, Kowloon, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a block-based bandwidth extension system to enhance the quality of narrowband speech signal (0-4 kHz). In memoryless bandwidth extension systems, the missing high-band components are estimated from narrowband speech using the current frame only. As the narrowband-to-wideband mapping is a one-to-many problem, this memoryless system is likely to cause hissing and whistling artifacts in the reproduced speech. Our method estimates high-band components via narrowband-to-wideband state sequence mapping using continuous density hidden Markov model (CDHMM) on a block basis. The speech block is either one word or a sequence of words in narrowband utterance. CDHMM estimation method avoids the one-to-many property of low-band and high-band dependency. Both subjective and objective evaluations show that hissing and whistling artifacts are reduced and the spectrally extended wideband speech (0-8 kHz) is pleasant to listen.
引用
收藏
页码:793 / 796
页数:4
相关论文
共 50 条
  • [1] Narrowband Speech Signal Bandwidth Extension for Intelligible Speech Communication
    Ganesh, Mirishkar Sai
    Patnaik, Bijayananda
    Karthik, M. L. N. S.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNIQUES IN CONTROL, OPTIMIZATION AND SIGNAL PROCESSING (INCOS), 2017,
  • [2] Bandwidth extension of narrowband speech using cepstral analysis
    Soon, IY
    Yeo, CK
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 242 - 245
  • [3] Bandwidth extension of narrowband speech using integer wavelet transform
    Nizampatnam, Prasad
    Tappeta, Kishore Kumar
    [J]. IET SIGNAL PROCESSING, 2017, 11 (04) : 437 - 445
  • [4] Bandwidth Extension of Narrowband Speech Based on Hidden Markov Model
    Yong, Zhang
    Yi, Liu
    [J]. 2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 372 - 376
  • [5] Mapping Neural Networks for Bandwidth Extension of Narrowband Speech
    Shahina, A.
    Yegnanarayana, B.
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1435 - 1438
  • [6] Combining equalization and estimation for bandwidth extension of narrowband speech
    Qian, YS
    Kabal, P
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 713 - 716
  • [7] Bandwidth extension of narrowband speech in log spectra domain using neural network
    Pourmohammadi, Sara
    Vali, Mansour
    Ghadyani, Mohsen
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2015, 23 (02) : 433 - 446
  • [8] Frequency extension of telephone narrowband speech signal using neural networks
    Botinhao, Cassia V.
    Carlos, Bruno S.
    Caloba, Luiz P.
    Petraglia, Mariane R.
    [J]. 2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 1576 - +
  • [9] COMBINING FRONTEND-BASED MEMORY WITH MFCC FEATURES FOR BANDWIDTH EXTENSION OF NARROWBAND SPEECH
    Nour-Eldin, Amr H.
    Kabal, Peter
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4001 - 4004
  • [10] Mel-Frequency Cepstral Coefficient-Based Bandwidth Extension of Narrowband Speech
    Nour-Eldin, Amr H.
    Kabal, Peter
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 53 - 56