Artificial bandwidth extension using deep neural network-based spectral envelope estimation and enhanced excitation estimation

被引:20
|
作者
Li, Yaxing [1 ]
Kang, Sangwon [1 ]
机构
[1] Hanyang Univ, Dept Elect & Commun Engn, Ansan 426791, South Korea
关键词
speech synthesis; neural nets; filtering theory; speech coding; artificial bandwidth extension; deep neural network-based spectral envelope estimation; enhanced excitation estimation; narrowband speech signal quality; enhanced spectrum envelope; excitation estimation; whitening filter; adaptive spectral double shifting method; adaptive multirate codec; log spectral distortion; perceptual evaluation;
D O I
10.1049/iet-spr.2015.0375
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The authors propose a robust artificial bandwidth extension (ABE) technique to improve narrowband (NB) speech signal quality using an enhanced spectrum envelope and excitation estimation. For envelope estimation, they propose an enhanced envelope estimation method using a deep neural network with multiple layers. For excitation estimation, they use a whitened NB excitation signal that is generated by passing the excitation signal through a whitening filter. An adaptive spectral double shifting method is introduced to obtain an enhanced wideband (WB) excitation signal. The proposed ABE system is applied to the decoded output of an adaptive multi-rate (AMR) codec at 12.2 kbps. They evaluate its performance using log spectral distortion, a WB perceptual evaluation of speech quality, and a formal listening test. The objective and subjective evaluations confirm that the proposed ABE system provides better speech quality than AMR at the same bit rate.
引用
收藏
页码:422 / 427
页数:6
相关论文
共 50 条
  • [31] Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
    Hu, Zhongxu
    Xing, Yang
    Lv, Chen
    Hang, Peng
    Liu, Jie
    NEUROCOMPUTING, 2021, 436 : 198 - 209
  • [32] DEEP NEURAL NETWORK-BASED NOISY PIXEL ESTIMATION FOR BREAST ULTRASOUND SEGMENTATION
    Jin, Songbai
    Lu, Wenkai
    Monkam, Patrice
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1776 - 1780
  • [33] Artificial Bandwidth Extension Using H∞ Optimization, Deep Neural Network, and Speech Production Model
    Gupta, Deepika
    Shekhawat, Hanumant Singh
    2022 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, SPCOM, 2022,
  • [34] Artificial neural network-based estimation of the eyeball position using the magnetic contact lens sensing technique
    Lee, Seung Yup
    Kim, Keun Young
    Kim, Hee Chan
    2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 7766 - 7768
  • [35] Deep neural network-based bandwidth enhancement of photoacoustic data
    Gutta, Sreedevi
    Kadimesetty, Venkata Suryanarayana
    Kalva, Sandeep Kumar
    Pramanik, Manojit
    Ganapathy, Sriram
    Yalavarthy, Phaneendra K.
    JOURNAL OF BIOMEDICAL OPTICS, 2017, 22 (11)
  • [36] Sensor Network-based Wind Field Estimation using Deep Learning
    Lee, Daniel
    Cisek, Daniel
    Yoo, Shinjae
    2017 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2017,
  • [37] DoA Estimation Using Neural Network-Based Covariance Matrix Reconstruction
    Barthelme, Andreas
    Utschick, Wolfgang
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 783 - 787
  • [38] Neural Network-Based Approach for ATC Estimation Using Distributed Computing
    Pandey, Seema N.
    Pandey, Nirved K.
    Tapaswi, Shashikala
    Srivastava, Laxmi
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2010, 25 (03) : 1291 - 1300
  • [39] Enhanced prediction using deep neural network-based image classification
    Ramalakshmi, K.
    Raghavan, V. Srinivasa
    IMAGING SCIENCE JOURNAL, 2023, 71 (05): : 472 - 483
  • [40] Neural Network-based Available Bandwidth Estimation from TCP Sender-side Measurements
    Khangura, Sukhpreet Kaur
    2019 8TH INTERNATIONAL CONFERENCE ON PERFORMANCE EVALUATION AND MODELING IN WIRED AND WIRELESS NETWORKS (PEMWN), 2019,