Artificial Bandwidth Extension Using H∞ Optimization and Speech Production Model

被引:0
|
作者
Gupta, Deepika [1 ]
Shekhawat, H. S. [1 ]
机构
[1] Indian Inst Technol Guwahati, Elect & Elect Engn, Gauhati, India
关键词
H-infinity system norm; speech production filter; signal model; codebook; lifting; SPECTRAL ENVELOPE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This work presents a new method for artificial bandwidth extension (ABE) in narrowband telephonic communication. In this regard, we use signal model and H-infinity optimization to obtain a synthesis filter for representing the wideband information of a signal. We need to estimate the high-band information in narrowband communication. Hence, we construct a high-band filter which retains the high-band information of the synthesis filter. Signal models may not be the same for different speech signals because of their non-stationary (time-varying) behavior. Hence, a short time processing (framing) is applied to speech signals for converting them into the stationary frames. Signal models of stationary frames may be different. As a result, their high-band filters will vary. So, a Gaussian mixture modelling (GMM) codebook approach is used to store the high-band filters information along with their narrowband information (narrowband feature). This approach is also used to estimate the high-band filter information for a given narrowband feature of the signal. Performance analysis is done for the two types of narrowband information representations.
引用
收藏
页码:181 / 186
页数:6
相关论文
共 50 条
  • [41] Speech Quality Evaluation of Artificial Bandwidth Extension: Comparing Subjective Judgments and Instrumental Predictions
    Pulakka, Hannu
    Myllyla, Ville
    Ramo, Anssi
    Alku, Paavo
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2583 - 2587
  • [42] ON THE POTENTIAL FOR ARTIFICIAL BANDWIDTH EXTENSION OF BONE AND TISSUE CONDUCTED SPEECH: MUTUAL INFORMATION STUDY
    Bouserhal, Rachel E.
    Falk, Tiago H.
    Voix, Jeremie
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5108 - 5112
  • [43] A Novel Method of Artificial Bandwidth Extension Using Deep Architecture
    Liu, Bin
    Tao, Jianhua
    Wen, Zhengqi
    Li, Ya
    Bukhari, Danish
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2598 - 2602
  • [44] SPEECH BANDWIDTH EXTENSION USING GAUSSIAN MIXTURE MODEL-BASED ESTIMATION OF THE HIGHBAND MEL SPECTRUM
    Pulakka, Hannu
    Remes, Ulpu
    Palomaki, Kalle
    Kurimo, Mikko
    Alku, Paavo
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5100 - 5103
  • [45] Bandwidth extension of narrow band speech using cepstral linear prediction
    Jaisimha, S
    Soon, IY
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1404 - 1407
  • [46] Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model
    Pulakka, Hannu
    Remes, Ulpu
    Yrttiaho, Santeri
    Palomaki, Kalle
    Kurimo, Mikko
    Alku, Paavo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1188 - +
  • [47] Speech Bandwidth Extension Using Recurrent Temporal Restricted Boltzmann Machines
    Wang, Yingxue
    Zhao, Shenghui
    Li, Jianxin
    Kuang, Jingming
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (12) : 1877 - 1881
  • [48] Speech bandwidth extension using transform-domain data hiding
    Kurada, Phaneendra
    Maruvada, Sailaja
    Sanagapallea, Koteswara Rao
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (02) : 305 - 312
  • [49] Generally Applicable Deep Speech Inpainting Using the Example of Bandwidth Extension
    Thieling, Lars
    Jax, Peter
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 451 - 455
  • [50] Speech bandwidth extension using transform-domain data hiding
    Phaneendra Kurada
    Sailaja Maruvada
    Koteswara Rao Sanagapallea
    International Journal of Speech Technology, 2019, 22 : 305 - 312