Exploring Audio Compression as Image Completion in Time-Frequency Domain

被引:0
|
作者
Scodeller, Giovanni [1 ]
Pistellato, Mara [1 ]
Bergamasco, Filippo [1 ]
机构
[1] Univ CaFoscari Venezia, DAIS, 155 Via Torino, Venice, Italy
关键词
Audio compression; CNN; Sparse convolutions; Spectrogram; genetic algorithm;
D O I
10.1007/978-3-031-43153-1_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Audio compression is usually achieved with algorithms that exploit spectral properties of the given signal such as frequency or temporal masking. In this paper we propose to tackle such a problem from a different point of view, considering the time-frequency domain of an audio signal as an intensity map to be reconstructed via a data-driven approach. The compression stage removes some selected input values from the time-frequency representation of the original signal. Then, decompression works by reconstructing the missing samples as an image completion task. Our method is divided into two main parts: first, we analyse the feasibility of a data-driven audio reconstruction with missing samples in its time-frequency representation. To do so, we exploit an existing CNN model designed for depth completion, involving a sequence of sparse convolutions to deal with absent values. Second, we propose a method to select the values to be removed at compression stage, maximizing the perceived audio quality of the decompressed signal. In the experimental section we validate the proposed technique on some standard audio datasets and provide an extensive study on the quality of the reconstructed signal under different conditions.
引用
收藏
页码:443 / 455
页数:13
相关论文
共 50 条
  • [1] Time-frequency algorithm of audio signal compression
    Rabinovich, E. V.
    Shekhirev, A. V.
    APEIE-2006 8TH INTERNATIONAL CONFERENCE ON ACTUAL PROBLEMS OF ELECTRONIC INSTRUMENT ENGINEERING PROCEEDINGS, VOL 1, 2006, : 147 - +
  • [2] EVALUATION OF AUDIO COMPANDORS IN THE TIME-FREQUENCY DOMAIN
    SKRITEK, P
    HLAWATSCH, F
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (05): : 386 - 386
  • [3] Time-frequency domain fast audio transcoding
    Ju, Fu-Shing
    Fang, Ce-Min
    ISM 2006: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2006, : 750 - 753
  • [4] Audio watermarking using time-frequency compression expansion
    Wei, FS
    Mun, HS
    Mei, NL
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3, PROCEEDINGS, 2004, : 201 - 204
  • [5] Audio Fingerprint Extraction Based on Time-Frequency Domain
    Liu, Zhengzheng
    Li, Cong
    Cao, Sanxing
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1975 - 1979
  • [6] Janssen 2.0: Audio Inpainting in the Time-frequency Domain
    Dept. of Telecommunications, Brno University of Technology, Czech Republic
    arXiv,
  • [7] Music Files Compression Based on Time-Frequency Representation of Audio Signal
    Shekhirev, Andrew V.
    Rabinovich, Evgeniy V.
    IFOST 2008: PROCEEDING OF THE THIRD INTERNATIONAL FORUM ON STRATEGIC TECHNOLOGIES, 2008, : 340 - 342
  • [8] A hybrid time-frequency domain approach to audio time-scale modification
    Dorran, D
    Lawlor, R
    Coyle, E
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2006, 54 (1-2): : 21 - 31
  • [9] A hybrid time-frequency domain approach to audio time-scale modification
    Dorran, David
    Lawlor, Robert
    Coyle, Eugene
    AES: Journal of the Audio Engineering Society, 1600, 54 (1-2): : 21 - 31
  • [10] A Robust Image Watermarking in the Joint Time-Frequency Domain
    Ozturk, Mahmut
    Akan, Aydin
    Cekic, Yalcin
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,