Exploring Audio Compression as Image Completion in Time-Frequency Domain

被引：0

作者：

Scodeller, Giovanni ^{[1
]}

Pistellato, Mara ^{[1
]}

Bergamasco, Filippo ^{[1
]}

机构：

[1] Univ CaFoscari Venezia, DAIS, 155 Via Torino, Venice, Italy

来源：

IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II | 2023年 / 14234卷

关键词：

Audio compression; CNN; Sparse convolutions; Spectrogram; genetic algorithm;

D O I：

10.1007/978-3-031-43153-1_37

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Audio compression is usually achieved with algorithms that exploit spectral properties of the given signal such as frequency or temporal masking. In this paper we propose to tackle such a problem from a different point of view, considering the time-frequency domain of an audio signal as an intensity map to be reconstructed via a data-driven approach. The compression stage removes some selected input values from the time-frequency representation of the original signal. Then, decompression works by reconstructing the missing samples as an image completion task. Our method is divided into two main parts: first, we analyse the feasibility of a data-driven audio reconstruction with missing samples in its time-frequency representation. To do so, we exploit an existing CNN model designed for depth completion, involving a sequence of sparse convolutions to deal with absent values. Second, we propose a method to select the values to be removed at compression stage, maximizing the perceived audio quality of the decompressed signal. In the experimental section we validate the proposed technique on some standard audio datasets and provide an extensive study on the quality of the reconstructed signal under different conditions.

引用

页码：443 / 455

页数：13

共 50 条

[1] Time-frequency algorithm of audio signal compression
Rabinovich, E. V.
Shekhirev, A. V.
APEIE-2006 8TH INTERNATIONAL CONFERENCE ON ACTUAL PROBLEMS OF ELECTRONIC INSTRUMENT ENGINEERING PROCEEDINGS, VOL 1, 2006, : 147 - +
[2] EVALUATION OF AUDIO COMPANDORS IN THE TIME-FREQUENCY DOMAIN
SKRITEK, P
HLAWATSCH, F
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1986, 34 (05): : 386 - 386
[3] Time-frequency domain fast audio transcoding
Ju, Fu-Shing
Fang, Ce-Min
ISM 2006: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2006, : 750 - 753
[4] Audio watermarking using time-frequency compression expansion
Wei, FS
Mun, HS
Mei, NL
2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3, PROCEEDINGS, 2004, : 201 - 204
[5] Audio Fingerprint Extraction Based on Time-Frequency Domain
Liu, Zhengzheng
Li, Cong
Cao, Sanxing
2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1975 - 1979
[6] Janssen 2.0: Audio Inpainting in the Time-frequency Domain
Dept. of Telecommunications, Brno University of Technology, Czech Republic
arXiv,
[7] Music Files Compression Based on Time-Frequency Representation of Audio Signal
Shekhirev, Andrew V.
Rabinovich, Evgeniy V.
IFOST 2008: PROCEEDING OF THE THIRD INTERNATIONAL FORUM ON STRATEGIC TECHNOLOGIES, 2008, : 340 - 342
[8] A hybrid time-frequency domain approach to audio time-scale modification
Dorran, D
Lawlor, R
Coyle, E
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2006, 54 (1-2): : 21 - 31
[9] A hybrid time-frequency domain approach to audio time-scale modification
Dorran, David
Lawlor, Robert
Coyle, Eugene
AES: Journal of the Audio Engineering Society, 1600, 54 (1-2): : 21 - 31
[10] A Robust Image Watermarking in the Joint Time-Frequency Domain
Ozturk, Mahmut
Akan, Aydin
Cekic, Yalcin
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,

← 1 2 3 4 5 →