Wide-band perceptual audio coding based on frequency-domain linear prediction

被引:0
|
作者
Motlicek, Petr [1 ]
Uallal, Vijay [2 ]
Hermansky, Hynek [1 ,3 ]
机构
[1] IDIAP Res Inst, Martigny, Switzerland
[2] Int Comp Sci Inst, Berkeley, CA USA
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
关键词
audio signal processing; data compression; linear predictive coding; Hilbert transform;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose an extension of the very low bit-rate speech coding technique, exploiting predictability of the temporal evolution of spectral envelopes, for wide-band audio coding applications. Temporal envelopes in critically band-sized sub-bands are estimated using frequency domain linear prediction applied on relatively long time segments. The sub-band residual signals, which play an important role in acquiring high quality reconstruction, are processed using a heterodyning-based signal analysis technique. For reconstruction, their optimal parameters are estimated using a closed-loop analysis-by-synthesis technique driven by a perceptual model emulating simultaneous masking properties of the human auditory system. We discuss the advantages of the approach and show some properties on challenging audio recordings. The proposed technique is capable of encoding high quality, variable rate audio signals on bit-rates below 1bit/sample.
引用
收藏
页码:265 / +
页数:2
相关论文
共 50 条
  • [21] Representations of the Complex-Valued Frequency-Domain LPC for Audio Coding
    Jo, Byeongho
    Beack, Seungkwon
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 361 - 365
  • [22] Frequency-domain instrumental variable based method for wide band system identification
    Gilson, Marion
    Welsh, James S.
    Garnier, Hugues
    2013 AMERICAN CONTROL CONFERENCE (ACC), 2013, : 1663 - 1668
  • [23] Frequency Band Selection Exited Linear Prediction Wideband Speech/Audio Coding Using SBR
    Jang, Sunghoon
    Lee, Insung
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2013, 32 (06): : 556 - 562
  • [24] Hardware-Efficient and Wide-Band Frequency-Domain Energy Detector for Cognitive-Radio Wireless Network
    Murty, Mahesh S.
    Shrestha, Rahul
    2018 31ST INTERNATIONAL CONFERENCE ON VLSI DESIGN AND 2018 17TH INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS (VLSID & ES), 2018, : 277 - 282
  • [25] Shape Control of Discrete Generalized Gaussian Distributions for Frequency-Domain Audio Coding
    Sugiura, Ryosuke
    Kamamoto, Yutaka
    Moriya, Takehiro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2234 - 2248
  • [26] FREQUENCY-DOMAIN CODING OF SPEECH
    TRIBOLET, JM
    CROCHIERE, RE
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (05): : 512 - 530
  • [27] Wide-band dereverberation method based on multichannel linear prediction using prewhitening filter
    Okamoto, Takuma
    Iwaya, Yukio
    Suzuki, Yoiti
    APPLIED ACOUSTICS, 2012, 73 (01) : 50 - 55
  • [28] Wideband speech and audio coding in the perceptual domain
    Lin, L
    Ambikairajah, E
    Holmes, WH
    ADVANCED SIGNAL PROCESSING FOR COMMUNICATION SYSTEMS, 2002, 703 : 15 - 30
  • [29] A frequency domain approach for wide-band non-Gaussian process
    Kim, H. -J.
    Jang, B. -S.
    TRENDS IN THE ANALYSIS AND DESIGN OF MARINE STRUCTURES, 2019, 2 : 79 - 86
  • [30] Filtering, smoothing and prediction for wide-band noise driven linear systems
    Bashirov, AE
    Etikan, H
    Semi, N
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1997, 334B (04): : 667 - 683