Scalable audio coder based on quantizer units of MDCT coefficients

被引：9

作者：

Jin, A ^{[1
]}

Moriya, T ^{[1
]}

Norimatsu, T ^{[1
]}

Tsushima, M ^{[1
]}

Ishikawa, T ^{[1
]}

机构：

[1] NTT Human Interface Labs, Musashino, Tokyo 1808585, Japan

来源：

ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI | 1999年

关键词：

D O I：

10.1109/ICASSP.1999.759816

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A scalable codec has been constructed by using transform coding and the basic modules for scalable encoder and decoder, It allows users to choose a variety of scalable configrations in the frequency domain. The basic module is a quantizer that can quantize MDCT (Modified DCT)[1] coefficients transformed from a variety of frequency regions. This module mainly works at bitrates of more than 8 kbit/s. We can also change the target frequency regions of the basic module's input-output signals in each transform frame; i.e., we can change the scalable structure according to the nature of input signals. in the scalable codec described here, the input-output signals are monaural and the sampling frequency is 24 kHz. The total bit rate of this scalable codec is more than 8 kbit/s. Subjective quality evaluation tests, mainly for musical sound sources, showed that it's sound quality is better than that of an MPEG2-layer3 codec at 8, 16, and 24 kbit/s when our scalable codec is construced of 8-kbit/s basic modules.

引用

页码：897 / 900

页数：4

共 50 条

[1] A conditional enhancement-layer quantizer for the scalable MPEG advanced audio coder
Aggarwal, A
Rose, K
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1833 - 1836
[2] A scalable watermarking scheme for the scalable audio coder
Li, Z
Sun, QB
Lian, Y
Yu, RS
[J]. ICC 2005: IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-5, 2005, : 1341 - 1346
[3] Robust Audio Watermarking Based on MDCT Coefficients
Wang, Mu-Liang
Lin, Hong-Xun
Lee, Mn-Ta
[J]. 2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 372 - 375
[4] Audio Perceptual Hashing Based on NMF and MDCT Coefficients
Li Jinfeng
Wang Hongxia
Jing Yi
[J]. CHINESE JOURNAL OF ELECTRONICS, 2015, 24 (03) : 579 - 583
[5] Audio Perceptual Hashing Based on NMF and MDCT Coefficients
LI Jinfeng
WANG Hongxia
JING Yi
[J]. Chinese Journal of Electronics, 2015, 24 (03) : 579 - 583
[6] Design and analysis of a scalable watermarking scheme for the scalable audio coder
Li, Zhi
Sun, Qibin
Lian, Yong
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (08) : 3064 - 3077
[7] Superwideband Bandwidth Extension Using Normalized MDCT Coefficients for Scalable Speech and Audio Coding
Lee, Young Han
Choi, Seung Ho
[J]. ADVANCES IN MULTIMEDIA, 2013, 2013
[8] A fine granular scalable to lossless audio coder
Yu, Rongshan
Rahardja, Susanto
Xiao, Lin
Ko, Chi Chung
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1352 - 1363
[9] A scalable low bitrate audio and speech coder
Zhang, Yong
Wang, Xiaochen
Hu, Ruimin
[J]. 2007 INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES, VOLS 1-3, 2007, : 1561 - 1565
[10] Lossless scalable audio coder and quality enhancement
Moriya, T
Jin, A
Mori, T
Ikeda, K
Kaneko, T
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1829 - 1832

← 1 2 3 4 5 →