An adaptive tiling of the time-frequency plane with application to multiresolution-based perceptive audio coding

被引:2
|
作者
Prelcic, NG [1 ]
Pena, AS [1 ]
机构
[1] Univ Vigo, ETSI Telecomunicac, Vigo 36200, Spain
关键词
tree structured filter banks; best basis selection; psychoacoustic model; perceptual audio coding;
D O I
10.1016/S0165-1684(00)00209-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The research efforts recently devoted to the theoretical treatment of time-varying filter bank structures have created an adequate environment for the development of adaptive analysis tools, useful in many signal processing problems. In this paper we present a complete adaptive analysis system suitable for the compression of audio signals; we would like to remark that our goal is not to develop a complete audio coder, just the appropriate analysis/synthesis scheme. For basis selection, we propose a search algorithm and a new cost function based on perceptual considerations. In addition, a procedure that determines the optimum length of the analysis segment is described. Examples are given of the performance of these algorithms on some representative audio signals. When the search method is compared with previous basis selection algorithms, we find and show that, for the application of audio compression, the perceptually driven basis selection algorithm leads to a much higher compression efficiency and better quality of the reconstructed signal. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:301 / 319
页数:19
相关论文
共 50 条
  • [1] An adaptive tree search algorithm with application to multiresolution based perceptive audio coding
    Prelcic, NG
    Pena, AS
    [J]. PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1996, : 117 - 120
  • [2] Perceptual coding of audio signals using adaptive time-frequency transform
    Umapathy K.
    Krishnan S.
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2007 (1)
  • [3] New algorithm for searching minimum bit rate wavelet representations with application to multiresolution-based perceptual audio coding
    Ruiz, N
    Rosa, M
    López, F
    Martínez, D
    Mata, R
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 286 - 289
  • [4] Algorithm for achieving adaptive tiling of time axis for audio coding purposes
    Ruiz, N
    Rosa, M
    López, F
    Vera, P
    [J]. ELECTRONICS LETTERS, 2002, 38 (09) : 434 - 435
  • [5] Audio coding using dynamic time-frequency decompositions
    Purat, M
    [J]. FREQUENZ, 1996, 50 (9-10) : 205 - 210
  • [6] Time-frequency audio feature extraction based on tensor representation of sparse coding
    Zhang, Xue-Yuan
    He, Qian-Hua
    [J]. ELECTRONICS LETTERS, 2015, 51 (02) : 131 - U20
  • [7] An Efficient Time-Frequency Representation for Parametric-Based Audio Object Coding
    Beack, Seungkwon
    Lee, Taejin
    Kim, Minje
    Kang, Kyeongok
    [J]. ETRI JOURNAL, 2011, 33 (06) : 945 - 948
  • [8] Time-Frequency Filtering Based on Model Fitting in the Time-Frequency Plane
    Colominas, Marcelo A.
    Meignen, Sylvain
    Duong-Hung Pham
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (05) : 660 - 664
  • [9] Block-recursive, multirate filterbanks with arbitrary time-frequency plane tiling
    Laine, UK
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 1461 - 1464
  • [10] Adaptive minimum entropy decomposition on the time-frequency plane
    Shan, Zeyong
    Aviyente, Selin
    [J]. 2005 IEEE/SP 13th Workshop on Statistical Signal Processing (SSP), Vols 1 and 2, 2005, : 801 - 804