A perceptual subspace approach for modeling of speech and audio signals with damped sinusoids

被引:27
|
作者
Jensen, J [1 ]
Heusdens, R
Jensen, SH
机构
[1] Delft Univ Technol, Dept Mediamat, NL-2628 CD Delft, Netherlands
[2] Aalborg Univ, Dept Commun Technol, DK-9220 Aalborg, Denmark
来源
关键词
complex exponentials; perceptually relevant sinusoids; psycho-acoustical distortion measure; sinusoidal modeling; speech and audio processing; subspace-based signal analysis;
D O I
10.1109/TSA.2003.819948
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The problem of modeling a signal segment as a sum of exponentially damped sinusoidal components arises in many different application areas, including speech and audio processing. Often, model parameters are estimated using subspace based techniques which arrange the input signal in a structured matrix and exploit the so-called shift-invariance property related to certain vector spaces of the input matrix. A problem with this class of estimation algorithms, when used for speech and audio processing, is that the perceptual importance of the sinusoidal components is not taken into account. In this work we propose a solution to this problem. In particular, we show how to combine well-known subspace based estimation techniques with a recently developed perceptual distortion measure, in order to obtain,an algorithm for extracting perceptually relevant model components. In analysis-synthesis experiments with wideband audio signals, objective and subjective evaluations show that the proposed Algorithm improves perceived signal quality considerable over traditional subspace based analysis methods.
引用
收藏
页码:121 / 132
页数:12
相关论文
共 50 条
  • [41] Estimation of sinusoids in audio signals using an analysis-by-synthesis neural network
    García, G
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 3369 - 3372
  • [42] 2-D Frequency Estimation of Multiple Damped Sinusoids Using Subspace and Projection Separation Approaches
    Huang, Longting
    Wu, Yuntao
    So, Hing Cheung
    Zhang, Yanduo
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2011, E94A (09) : 1842 - 1846
  • [43] Towards a new perceptual coding paradigm for audio signals
    Der, R
    Kabal, P
    Chan, WY
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 457 - 460
  • [44] Subspace-based parameter estimation of exponentially damped sinusoids using prior knowledge of frequency and phase
    Chen, H
    VanHuffel, S
    vandenBoom, A
    vandenBosch, P
    SIGNAL PROCESSING, 1997, 59 (01) : 129 - 136
  • [45] A review of algorithms for perceptual coding of digital audio signals
    Painter, T
    Spanias, A
    DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 179 - 208
  • [46] Declipping of Audio Signals Using Perceptual Compressed Sensing
    Defraene, Bruno
    Mansour, Naim
    De Hertogh, Steven
    van Waterschoot, Toon
    Diehl, Moritz
    Moonen, Marc
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (12): : 2627 - 2637
  • [47] ESTIMATING THE PARAMETERS OF EXPONENTIALLY DAMPED UNDAMPED SINUSOIDS IN NOISE - A NONITERATIVE APPROACH
    KUNDU, D
    MITRA, A
    SIGNAL PROCESSING, 1995, 46 (03) : 363 - 368
  • [48] A progressive approach for perceptual audio coding
    Shen, Y
    Ai, HM
    Kuo, CCJ
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 815 - 818
  • [49] Parametric subspace modeling of speech transitions
    Reinhard, K
    Niranjan, M
    SPEECH COMMUNICATION, 1999, 27 (01) : 19 - 42
  • [50] Robust noise reduction for speech and audio signals
    Godsill, SJ
    Rayner, PJW
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 625 - 628