A Perceptual Model for Sinusoidal Audio Coding Based on Spectral Integration

被引:0
|
作者
Steven van de Par
Armin Kohlrausch
Richard Heusdens
Jesper Jensen
Søren Holdt Jensen
机构
[1] Philips Research Laboratories,Digital Signal Processing Group
[2] Eindhoven University of Technology,Department of Technology Management
[3] Delft University of Technology,Department of Mediamatics
[4] Institute of Electronic Systems,Department of Communication Technology
[5] Aalborg University,undefined
关键词
audio coding; psychoacoustical modelling; auditory masking; spectral masking; sinusoidal modelling; psychoacoustical matching pursuit;
D O I
暂无
中图分类号
学科分类号
摘要
Psychoacoustical models have been used extensively within audio coding applications over the past decades. Recently, parametric coding techniques have been applied to general audio and this has created the need for a psychoacoustical model that is specifically suited for sinusoidal modelling of audio signals. In this paper, we present a new perceptual model that predicts masked thresholds for sinusoidal distortions. The model relies on signal detection theory and incorporates more recent insights about spectral and temporal integration in auditory masking. As a consequence, the model is able to predict the distortion detectability. In fact, the distortion detectability defines a (perceptually relevant) norm on the underlying signal space which is beneficial for optimisation algorithms such as rate-distortion optimisation or linear predictive coding. We evaluate the merits of the model by combining it with a sinusoidal extraction method and compare the results with those obtained with the ISO MPEG-1 Layer I-II recommended model. Listening tests show a clear preference for the new model. More specifically, the model presented here leads to a reduction of more than 20% in terms of number of sinusoids needed to represent signals at a given quality level.
引用
收藏
相关论文
共 50 条
  • [1] A perceptual model for sinusoidal audio coding based on spectral integration
    van de Par, S
    Kohlrausch, A
    Heusdens, R
    Jensen, J
    Jensen, SH
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (09) : 1292 - 1304
  • [2] Perceptual component selection in sinusoidal coding of audio
    Painter, T
    Spanias, A
    2001 IEEE FOURTH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2001, : 187 - 192
  • [3] Perceptual audio coding using sinusoidal/optimum wavelet representation
    Sathidevi, PS
    Venkataramani, Y
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2002, 21 (05) : 511 - 524
  • [4] Perceptual Audio Coding Using Sinusoidal/Optimum Wavelet Representation
    P.S. Sathidevi
    Y. Venkataramani
    Circuits, Systems and Signal Processing, 2002, 21 : 511 - 524
  • [5] Joint speech/audio coding based scalable perceptual audio coding
    Gao, Li
    Hu, Ruimin
    Yang, Yuhong
    2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
  • [6] Sinusoidal modelling using perceptual matching pursuits in the Bark scale for parametric audio coding
    Vera-Candeas, P.
    Ruiz-Reyes, N.
    Cuevas-Martinez, J. C.
    Rosa-Zurera, M.
    Lopez-Ferreras, F.
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2006, 153 (04): : 431 - 435
  • [7] Perceptual coding of digital audio
    Painter, T
    Spanias, A
    PROCEEDINGS OF THE IEEE, 2000, 88 (04) : 451 - 513
  • [8] On frequency quantization in sinusoidal audio coding
    Vafin, R
    Prakash, D
    Kleijn, WB
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (03) : 210 - 213
  • [9] MATCHING PURSUITS BASED ON PERCEPTUAL DISTORTION MINIMIZATION FOR SINUSOIDAL AUDIO MODELLING
    Ruiz Reyes, N.
    Vera Candeas, P.
    Canadas, F. J.
    Carabias, J. J.
    Cabanas, P.
    Rodriguez, F.
    SIGMAP 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATION, 2010, : 97 - 102
  • [10] Audio coding based on rate-distortion and perceptual optimization
    Erne, M
    Moschytz, G
    WAVELET APPLICATIONS VII, 2000, 4056 : 235 - 246