Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding

被引:4
|
作者
Das, Sneha [1 ]
Backstrom, Tom [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
关键词
Quantization noise; Speech modelling; postfiltering; noise filling; Time-Frequency correlation;
D O I
10.21437/Interspeech.2018-1027
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advanced coding algorithms yield high quality signals with good coding efficiency within their target bit-rate ranges, but their performance suffer outside the target range. At lower bitrates, the degradation in performance is because the decoded signals are sparse, which gives a perceptually muffled and distorted characteristic to the signal. Standard codecs reduce such distortions by applying noise filling and post-filtering methods. In this paper, we propose a post-processing method based on modeling the inherent time-frequency correlation in the log-magnitude spectrum. The goal is to improve the perceptual SNR of the decoded signals and, to reduce the distortions caused by signal sparsity. Objective measures show an average improvement of 1.5 dB for input perceptual SNR in range 4 to 18 dB. The improvement is especially prominent in components which had been quantized to zero.
引用
收藏
页码:3543 / 3547
页数:5
相关论文
共 50 条
  • [31] Sound specific modelling and synthesis with a new postfiltering in low bit rate speech coding
    de Lamare, RC
    da Silva, LM
    Alcaim, A
    2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL III, PROCEEDINGS, 2002, : 843 - 846
  • [32] CODING OF SPEECH AND WIDE-BAND AUDIO
    JAYANT, NS
    LAWRENCE, VB
    PREZAS, DP
    AT&T TECHNICAL JOURNAL, 1990, 69 (05): : 25 - 41
  • [33] Wideband speech and audio coding in the perceptual domain
    Lin, L
    Ambikairajah, E
    Holmes, WH
    ADVANCED SIGNAL PROCESSING FOR COMMUNICATION SYSTEMS, 2002, 703 : 15 - 30
  • [34] WIDE-BAND SPEECH AND AUDIO CODING
    NOLL, P
    IEEE COMMUNICATIONS MAGAZINE, 1993, 31 (11) : 34 - 44
  • [35] A novel fast algorithm for speech and audio coding
    Guz, Umit
    Gurkan, Hakan
    Yarman, B. Siddik
    2007 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, 2007, : 4020 - +
  • [36] SPECTRUM ANALYSIS IN SPEECH CODING
    FLANAGAN, JL
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1967, AU15 (02): : 66 - &
  • [37] Bandwidth extension of telephone speech using magnitude spectrum data hiding
    Nizampatnam P.
    Tappeta K.K.
    International Journal of Speech Technology, 2017, 20 (1) : 151 - 162
  • [38] Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification
    Gaubitch, Nikolay D.
    Brookes, Mike
    Naylor, Patrick A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2162 - 2171
  • [39] Hybrid Audio Coding for speech and audio below medium bit bate
    Makino, K
    Matsumoto, J
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - 2000 DIGEST OF TECHNICAL PAPERS, 2000, : 264 - 265
  • [40] Combined coding of audio and speech signals using LPC and the discrete wavelet transform
    Mason, M
    Boland, S
    Sridharan, S
    Deriche, M
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 747 - 750