Performance evaluation of an audio perceptual subband coder with dynamic bit allocation

被引:0
|
作者
Caini, C
Coralli, AV
机构
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the paper the performance of a perceptual audio coder for high quality, low bit rate coding of single notes of orchestra instruments such as pianos, bass drums, snares, triangles, etc, is investigated The coding system consists of a tree-structured perfect reconstruction filter bank with a dynamic bit allocation based on a backward estimation of the perceptible signal energy. The algorithm is applied to CD-quality elementary instruments sounds. The quality assessment is done on the basis of both objective and subjective measures. The subjective tests were carried out by expert musicians who judged the level of similarity between the original CD sound and the compressed one. Moreover, to investigate the possibility of a real implementation of the algorithm, the complexity cost was taken Into account, and the evaluation was carried out for different filter lengths, using 16 bits quantized coefficient filter sets. Results show that transparent coding can be achieved for such signals at 1.6 divided by 1.7 bit per sample depending on the instruments considered.
引用
收藏
页码:567 / 570
页数:4
相关论文
共 50 条
  • [41] A low bit-rate audio coder based on modified sinusoidal model
    Song, SP
    Yin, JX
    Yu, YC
    Raed, AM
    2002 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS AND WEST SINO EXPOSITION PROCEEDINGS, VOLS 1-4, 2002, : 648 - 652
  • [42] A design of transform coder for both speech and audio signals at 1 bit/sample
    Moriya, T
    Iwakami, N
    Jin, A
    Ikeda, K
    Miki, S
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 1371 - 1374
  • [43] Perceptually transparent audio compression based on a variable bit rate AAC coder
    Szwabe, A
    Jedrzejek, C
    PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 685 - 690
  • [44] Implementation and Evaluation of Variable Bit Rates CELP Coder
    Mahmoud, Eman Mohammed
    Elgarf, Talaat A.
    Abd Elhafez, Ahmed
    Zekry, Abd El-Halim
    2012 SEVENTH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES'2012), 2012, : 75 - 80
  • [45] A fast bit allocation algorithm for MPEG audio encoder
    Fung, KT
    Chan, YL
    Siu, YC
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 5 - 8
  • [46] A perceptual bit allocation scheme for H.264
    Yu, HT
    Pan, F
    Lin, ZP
    Sun, Y
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 313 - 316
  • [47] SUBBAND/VQ CODING OF COLOR IMAGES WITH PERCEPTUALLY OPTIMAL BIT ALLOCATION
    VANDYCK, RE
    RAJALA, SA
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1994, 4 (01) : 68 - 82
  • [48] Enhancing the performance of subband audio coders for speech signals
    Malvar, H
    ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : D98 - D101
  • [49] Perceptual Audio Object Coding Using Adaptive Subband Grouping with CNN and Residual Block
    Wu, Yulin
    Hu, Ruimin
    Wang, Xiaochen
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2543 - 2548
  • [50] A new quantization optimization algorithm for the MPEG advanced audio coder using a statistical subband model of the quantization noise
    Derrien, O
    Duhamel, P
    Charbit, M
    Richard, G
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1328 - 1339