Low power MPEG/Audio encoders using simplified psychoacoustic model and fast bit allocation

被引:9
|
作者
Oh, HO
Kim, JS
Song, CJ
Park, YC
Youn, DH
机构
[1] Yonsei Univ, Dept Elect & Elect Engn, ASSP Lab, Sudaemoon Ku, Seoul 120749, South Korea
[2] Yonsei Univ, Ctr Signal Proc Res, ASSP Lab, Sudaemoon Ku, Seoul 120749, South Korea
关键词
D O I
10.1109/30.964154
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose novel techniques for the implementation of MPEG/Audio (Layer II and Layer III) encoder. The proposed techniques concern implementing the encoder with a minimum complexity. As an effort to minimize the complexity, the ISO psycho-acoustic model (PAM) is simplified that often demands significant computational power of the implementation system. The simplification follows the statistical behavior of the PAM. A fast bit allocation algorithm is also developed, in which the quantizer step size is updated dynamically and adaptively according to input signal statistics. The performance of the developed techniques is verified via subjective tests as well as statistical analyses. Real-time implementations are tried for MEPG/Audio Layer II and Layer III encoders employing the proposed algorithms. The implemented systems show that the developed encoders can be as simple as decoders, but still produce bitstreams of high audio quality.
引用
收藏
页码:613 / 621
页数:9
相关论文
共 50 条
  • [31] A low-complexity joint-coding method for mpeg-4 audio lossless coding encoders
    Cho, Choong Sang
    Kim, Je Woo
    Choi, Byeong Ho
    Kim, Dong Sun
    ICIC Express Letters, 2012, 6 (07): : 1713 - 1719
  • [32] A New Method for Using a Psychoacoustic Model with Patchwork Audio Watermarking in DFT Domain
    Tavakoli, Ehsan
    Tabandeh, Mahmoud
    IECON 2008: 34TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-5, PROCEEDINGS, 2008, : 1758 - 1763
  • [33] MPEG-1 psychoacoustic model emulation using multiscale convolutional neural networks
    Kemper, Guillermo
    Sanchez, Alonso
    Serpa, Sergio
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 6963 - 6974
  • [34] MPEG-1 psychoacoustic model emulation using multiscale convolutional neural networks
    Guillermo Kemper
    Alonso Sanchez
    Sergio Serpa
    Multimedia Tools and Applications, 2024, 83 : 6963 - 6974
  • [35] Improved audio coding using a psychoacoustic model based on a cochlear filter bank
    Baumgarte, F
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 495 - 503
  • [36] Fast implementation of MPEG audio coder using recursive formula with fast discrete cosine transforms
    Chan, DY
    Yang, JF
    Fang, CC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (02): : 144 - 148
  • [37] A fast frame type selection technique for very low bit rate coding using MPEG-1
    Lee, J
    REAL-TIME IMAGING, 1999, 5 (02) : 83 - 94
  • [38] An bit allocation method based rate-distortion control algorithm for MPEG-4 advanced audio coding
    Wu, Sheng
    Qiu, Xiaojun
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 237 - 241
  • [39] FAST LOW BIT RATE LATTICE ENTROPY CODING FOR SPEECH AND AUDIO CODING
    Vasilache, Adriana
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 719 - 723
  • [40] Interactive MPEG-4 low bit-rate speech/audio transmission over Internet
    Liu, F
    Kim, J
    Kuo, CCJ
    MULTIMEDIA SYSTEMS AND APPLICATIONS II, 1999, 3845 : 212 - 221