Scalable audio coder based on quantizer units of MDCT coefficients

被引:9
|
作者
Jin, A [1 ]
Moriya, T [1 ]
Norimatsu, T [1 ]
Tsushima, M [1 ]
Ishikawa, T [1 ]
机构
[1] NTT Human Interface Labs, Musashino, Tokyo 1808585, Japan
关键词
D O I
10.1109/ICASSP.1999.759816
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A scalable codec has been constructed by using transform coding and the basic modules for scalable encoder and decoder, It allows users to choose a variety of scalable configrations in the frequency domain. The basic module is a quantizer that can quantize MDCT (Modified DCT)[1] coefficients transformed from a variety of frequency regions. This module mainly works at bitrates of more than 8 kbit/s. We can also change the target frequency regions of the basic module's input-output signals in each transform frame; i.e., we can change the scalable structure according to the nature of input signals. in the scalable codec described here, the input-output signals are monaural and the sampling frequency is 24 kHz. The total bit rate of this scalable codec is more than 8 kbit/s. Subjective quality evaluation tests, mainly for musical sound sources, showed that it's sound quality is better than that of an MPEG2-layer3 codec at 8, 16, and 24 kbit/s when our scalable codec is construced of 8-kbit/s basic modules.
引用
收藏
页码:897 / 900
页数:4
相关论文
共 50 条
  • [1] A conditional enhancement-layer quantizer for the scalable MPEG advanced audio coder
    Aggarwal, A
    Rose, K
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1833 - 1836
  • [2] A scalable watermarking scheme for the scalable audio coder
    Li, Z
    Sun, QB
    Lian, Y
    Yu, RS
    [J]. ICC 2005: IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-5, 2005, : 1341 - 1346
  • [3] Robust Audio Watermarking Based on MDCT Coefficients
    Wang, Mu-Liang
    Lin, Hong-Xun
    Lee, Mn-Ta
    [J]. 2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 372 - 375
  • [4] Audio Perceptual Hashing Based on NMF and MDCT Coefficients
    Li Jinfeng
    Wang Hongxia
    Jing Yi
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 579 - 583
  • [5] Audio Perceptual Hashing Based on NMF and MDCT Coefficients
    LI Jinfeng
    WANG Hongxia
    JING Yi
    [J]. Chinese Journal of Electronics, 2015, 24 (03) : 579 - 583
  • [6] Design and analysis of a scalable watermarking scheme for the scalable audio coder
    Li, Zhi
    Sun, Qibin
    Lian, Yong
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (08) : 3064 - 3077
  • [7] Superwideband Bandwidth Extension Using Normalized MDCT Coefficients for Scalable Speech and Audio Coding
    Lee, Young Han
    Choi, Seung Ho
    [J]. ADVANCES IN MULTIMEDIA, 2013, 2013
  • [8] A fine granular scalable to lossless audio coder
    Yu, Rongshan
    Rahardja, Susanto
    Xiao, Lin
    Ko, Chi Chung
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1352 - 1363
  • [9] A scalable low bitrate audio and speech coder
    Zhang, Yong
    Wang, Xiaochen
    Hu, Ruimin
    [J]. 2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1561 - 1565
  • [10] Lossless scalable audio coder and quality enhancement
    Moriya, T
    Jin, A
    Mori, T
    Ikeda, K
    Kaneko, T
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1829 - 1832