Artificial bandwidth extension using deep neural network-based spectral envelope estimation and enhanced excitation estimation

被引:20
|
作者
Li, Yaxing [1 ]
Kang, Sangwon [1 ]
机构
[1] Hanyang Univ, Dept Elect & Commun Engn, Ansan 426791, South Korea
关键词
speech synthesis; neural nets; filtering theory; speech coding; artificial bandwidth extension; deep neural network-based spectral envelope estimation; enhanced excitation estimation; narrowband speech signal quality; enhanced spectrum envelope; excitation estimation; whitening filter; adaptive spectral double shifting method; adaptive multirate codec; log spectral distortion; perceptual evaluation;
D O I
10.1049/iet-spr.2015.0375
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The authors propose a robust artificial bandwidth extension (ABE) technique to improve narrowband (NB) speech signal quality using an enhanced spectrum envelope and excitation estimation. For envelope estimation, they propose an enhanced envelope estimation method using a deep neural network with multiple layers. For excitation estimation, they use a whitened NB excitation signal that is generated by passing the excitation signal through a whitening filter. An adaptive spectral double shifting method is introduced to obtain an enhanced wideband (WB) excitation signal. The proposed ABE system is applied to the decoded output of an adaptive multi-rate (AMR) codec at 12.2 kbps. They evaluate its performance using log spectral distortion, a WB perceptual evaluation of speech quality, and a formal listening test. The objective and subjective evaluations confirm that the proposed ABE system provides better speech quality than AMR at the same bit rate.
引用
收藏
页码:422 / 427
页数:6
相关论文
共 50 条
  • [21] An Artificial Neural Network-Based Model for Effective Software Development Effort Estimation
    Rashid, Junaid
    Kanwal, Sumera
    Nisar, Muhammad Wasif
    Kim, Jungeun
    Hussain, Amir
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (02): : 1309 - 1324
  • [22] Comparative study of conventional and artificial neural network-based ETo estimation models
    M. Kumar
    A. Bandyopadhyay
    N. S. Raghuwanshi
    R. Singh
    Irrigation Science, 2008, 26 : 531 - 545
  • [23] An artificial neural network-based earthquake casualty estimation model for Istanbul city
    Gul, Muhammet
    Guneri, Ali Fuat
    NATURAL HAZARDS, 2016, 84 (03) : 2163 - 2178
  • [24] A probabilistic artificial neural network-based procedure for variance change point estimation
    Amiri, Amirhossein
    Niaki, S. T. A.
    Moghadam, Alireza Taheri
    SOFT COMPUTING, 2015, 19 (03) : 691 - 700
  • [25] An Artificial Neural Network-Based Estimation of Bremsstarahlung Photon Flux Calculated by MCNPX
    Tekin, H. O.
    Manici, T.
    Altunsoy, E. E.
    Yilancioglu, K.
    Yilmaz, B.
    ACTA PHYSICA POLONICA A, 2017, 132 (03) : 967 - 969
  • [26] An artificial neural network-based earthquake casualty estimation model for Istanbul city
    Muhammet Gul
    Ali Fuat Guneri
    Natural Hazards, 2016, 84 : 2163 - 2178
  • [27] Artificial Neural Network-Based Activities Classification, Gait Phase Estimation, and Prediction
    Shuangyue Yu
    Jianfu Yang
    Tzu-Hao Huang
    Junxi Zhu
    Christopher J. Visco
    Farah Hameed
    Joel Stein
    Xianlian Zhou
    Hao Su
    Annals of Biomedical Engineering, 2023, 51 : 1471 - 1484
  • [28] Neural Network-Based Estimation for OFDM Channels
    Cheng, Chia-Hsin
    Huang, Yung-Fa
    Huang, Yao-Hung
    Chen, Hsing-Chung
    Yao, Tsung-Yu
    2015 IEEE 29th International Conference on Advanced Information Networking and Applications (IEEE AINA 2015), 2015, : 600 - 604
  • [29] Neural network-based ATM QoS estimation
    Sheng, WB
    Rueda, J
    Blight, D
    IEEE WESCANEX 97 COMMUNICATIONS, POWER AND COMPUTING CONFERENCE PROCEEDINGS, 1997, : 1 - 6
  • [30] Performance estimation of a neural network-based controller
    Schumann, Johann
    Liu, Yan
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 2, PROCEEDINGS, 2006, 3972 : 981 - 990