Wide-band perceptual audio coding based on frequency-domain linear prediction

被引:0
|
作者
Motlicek, Petr [1 ]
Uallal, Vijay [2 ]
Hermansky, Hynek [1 ,3 ]
机构
[1] IDIAP Res Inst, Martigny, Switzerland
[2] Int Comp Sci Inst, Berkeley, CA USA
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
audio signal processing; data compression; linear predictive coding; Hilbert transform;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an extension of the very low bit-rate speech coding technique, exploiting predictability of the temporal evolution of spectral envelopes, for wide-band audio coding applications. Temporal envelopes in critically band-sized sub-bands are estimated using frequency domain linear prediction applied on relatively long time segments. The sub-band residual signals, which play an important role in acquiring high quality reconstruction, are processed using a heterodyning-based signal analysis technique. For reconstruction, their optimal parameters are estimated using a closed-loop analysis-by-synthesis technique driven by a perceptual model emulating simultaneous masking properties of the human auditory system. We discuss the advantages of the approach and show some properties on challenging audio recordings. The proposed technique is capable of encoding high quality, variable rate audio signals on bit-rates below 1bit/sample.
引用
收藏
页码:265 / +
页数:2
相关论文
共 50 条
  • [1] Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction
    Petr Motlicek
    Sriram Ganapathy
    Hynek Hermansky
    Harinath Garudadri
    EURASIP Journal on Audio, Speech, and Music Processing, 2010
  • [2] Wide-Band Audio Coding Based on Frequency-Domain Linear Prediction
    Motlicek, Petr
    Ganapathy, Sriram
    Hermansky, Hynek
    Garudadri, Harinath
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,
  • [3] WIDE-BAND ACOUSTOOPTIC MODULATOR FOR FREQUENCY-DOMAIN FLUOROMETRY
    PISTON, DW
    GRATTON, E
    BIOPHYSICAL JOURNAL, 1986, 49 (02) : A467 - A467
  • [4] WIDE-BAND SPEECH AND AUDIO CODING
    NOLL, P
    IEEE COMMUNICATIONS MAGAZINE, 1993, 31 (11) : 34 - 44
  • [5] CODING OF SPEECH AND WIDE-BAND AUDIO
    JAYANT, NS
    LAWRENCE, VB
    PREZAS, DP
    AT&T TECHNICAL JOURNAL, 1990, 69 (05): : 25 - 41
  • [6] Wide-Band Speech Coding Based on Bandwidth Extension and Sparse Linear Prediction
    Alipoor, Ghasem
    Savoji, Mohamad Hasan
    2012 35TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2012, : 454 - 459
  • [7] Frequency-domain lifetime prediction methods for the PoP under wide-band random vibration loading
    Xia, Jiang
    Shen, ZhongHong
    Peng, Qi
    Xu, HuaWei
    Huan, Linyi
    Liu, BinHui
    Yue, YaJiao
    Li, GuoYuan
    2018 19TH INTERNATIONAL CONFERENCE ON ELECTRONIC PACKAGING TECHNOLOGY (ICEPT), 2018, : 648 - 651
  • [8] Computationally efficient amplitude modulated sinusoidal audio coding using frequency-domain linear prediction
    Christensen, Mads Grasboll
    Jensen, Soren Holdt
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 4919 - 4922
  • [9] Progress in LPC-based frequency-domain audio coding
    Moriya, Takehiro
    Sugiura, Ryosuke
    Kamamoto, Yutaka
    Kameoka, Hirokazu
    Harada, Noboru
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2016, 5
  • [10] Nonuniform frequency sampling with active learning: Application to wide-band frequency-domain modeling and design
    Zhao, ZQ
    Ahn, CH
    Carin, L
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2005, 53 (09) : 3049 - 3057