Wide-band perceptual audio coding based on frequency-domain linear prediction

被引:0
|
作者
Motlicek, Petr [1 ]
Uallal, Vijay [2 ]
Hermansky, Hynek [1 ,3 ]
机构
[1] IDIAP Res Inst, Martigny, Switzerland
[2] Int Comp Sci Inst, Berkeley, CA USA
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
audio signal processing; data compression; linear predictive coding; Hilbert transform;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an extension of the very low bit-rate speech coding technique, exploiting predictability of the temporal evolution of spectral envelopes, for wide-band audio coding applications. Temporal envelopes in critically band-sized sub-bands are estimated using frequency domain linear prediction applied on relatively long time segments. The sub-band residual signals, which play an important role in acquiring high quality reconstruction, are processed using a heterodyning-based signal analysis technique. For reconstruction, their optimal parameters are estimated using a closed-loop analysis-by-synthesis technique driven by a perceptual model emulating simultaneous masking properties of the human auditory system. We discuss the advantages of the approach and show some properties on challenging audio recordings. The proposed technique is capable of encoding high quality, variable rate audio signals on bit-rates below 1bit/sample.
引用
收藏
页码:265 / +
页数:2
相关论文
共 50 条
  • [31] Single-Mode-Based Unified Speech and Audio Coding by Extending the Linear Prediction Domain Coding Mode
    Beack, Seungkwon
    Seong, Jongmo
    Lee, Misuk
    Lee, Taejin
    ETRI JOURNAL, 2017, 39 (03) : 310 - 318
  • [32] INTEGRATED LINEAR WIDE-BAND AMPLIFIERS
    WEINERTH, H
    ELEKTROTECHNISCHE ZEITSCHRIFT B-AUSGABE, 1970, 22 (7-8): : 142 - &
  • [33] Optimization of wide-band linear arrays
    Cardone, G
    Cincotti, G
    Gori, P
    Pappalardo, M
    IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2001, 48 (04) : 943 - 952
  • [34] WIDE-BAND OPTICAL FREQUENCY TRANSLATION
    KERR, JR
    PROCEEDINGS OF THE INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS, 1965, 53 (05): : 496 - &
  • [35] WIDE-BAND FREQUENCY-CONVERTER
    MELIKHOV, SV
    TITOV, AA
    INSTRUMENTS AND EXPERIMENTAL TECHNIQUES, 1989, 32 (05) : 1174 - 1175
  • [36] HIERARCHICAL MULTI-CHANNEL AUDIO CODING BASED ON TIME-DOMAIN LINEAR PREDICTION
    Schaefer, Magnus
    Vary, Peter
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2148 - 2152
  • [37] FREQUENCY-DOMAIN TECHNIQUES FOR SPEECH CODING
    CROCHIERE, RE
    TRIBOLET, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S139 - S139
  • [38] Joint speech/audio coding based scalable perceptual audio coding
    Gao, Li
    Hu, Ruimin
    Yang, Yuhong
    2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
  • [39] A RADIOMETER EMPLOYING FREQUENCY-DOMAIN CODING
    AITKEN, GJM
    MILLS, RJ
    RADIO SCIENCE, 1970, 5 (03) : 535 - &
  • [40] FREQUENCY-DOMAIN TECHNIQUES FOR SPEECH CODING
    CROCHIERE, RE
    TRIBOLET, JM
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (06): : 1642 - 1646