Low power MPEG/Audio encoders using simplified psychoacoustic model and fast bit allocation

被引：9

作者：

Oh, HO

Kim, JS

Song, CJ

Park, YC

Youn, DH

机构：

[1] Yonsei Univ, Dept Elect & Elect Engn, ASSP Lab, Sudaemoon Ku, Seoul 120749, South Korea

[2] Yonsei Univ, Ctr Signal Proc Res, ASSP Lab, Sudaemoon Ku, Seoul 120749, South Korea

来源：

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS | 2001年 / 47卷 / 03期

关键词：

D O I：

10.1109/30.964154

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose novel techniques for the implementation of MPEG/Audio (Layer II and Layer III) encoder. The proposed techniques concern implementing the encoder with a minimum complexity. As an effort to minimize the complexity, the ISO psycho-acoustic model (PAM) is simplified that often demands significant computational power of the implementation system. The simplification follows the statistical behavior of the PAM. A fast bit allocation algorithm is also developed, in which the quantizer step size is updated dynamically and adaptively according to input signal statistics. The performance of the developed techniques is verified via subjective tests as well as statistical analyses. Real-time implementations are tried for MEPG/Audio Layer II and Layer III encoders employing the proposed algorithms. The implemented systems show that the developed encoders can be as simple as decoders, but still produce bitstreams of high audio quality.

引用

页码：613 / 621

页数：9

共 50 条

[31] A low-complexity joint-coding method for mpeg-4 audio lossless coding encoders
Cho, Choong Sang
Kim, Je Woo
Choi, Byeong Ho
Kim, Dong Sun
ICIC Express Letters, 2012, 6 (07): : 1713 - 1719
[32] A New Method for Using a Psychoacoustic Model with Patchwork Audio Watermarking in DFT Domain
Tavakoli, Ehsan
Tabandeh, Mahmoud
IECON 2008: 34TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-5, PROCEEDINGS, 2008, : 1758 - 1763
[33] MPEG-1 psychoacoustic model emulation using multiscale convolutional neural networks
Kemper, Guillermo
Sanchez, Alonso
Serpa, Sergio
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 6963 - 6974
[34] MPEG-1 psychoacoustic model emulation using multiscale convolutional neural networks
Guillermo Kemper
Alonso Sanchez
Sergio Serpa
Multimedia Tools and Applications, 2024, 83 : 6963 - 6974
[35] Improved audio coding using a psychoacoustic model based on a cochlear filter bank
Baumgarte, F
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 495 - 503
[36] Fast implementation of MPEG audio coder using recursive formula with fast discrete cosine transforms
Chan, DY
Yang, JF
Fang, CC
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (02): : 144 - 148
[37] A fast frame type selection technique for very low bit rate coding using MPEG-1
Lee, J
REAL-TIME IMAGING, 1999, 5 (02) : 83 - 94
[38] An bit allocation method based rate-distortion control algorithm for MPEG-4 advanced audio coding
Wu, Sheng
Qiu, Xiaojun
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 237 - 241
[39] FAST LOW BIT RATE LATTICE ENTROPY CODING FOR SPEECH AND AUDIO CODING
Vasilache, Adriana
19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 719 - 723
[40] Interactive MPEG-4 low bit-rate speech/audio transmission over Internet
Liu, F
Kim, J
Kuo, CCJ
MULTIMEDIA SYSTEMS AND APPLICATIONS II, 1999, 3845 : 212 - 221

← 1 2 3 4 5 →