PHAIN: Audio Inpainting via Phase-Aware Optimization With Instantaneous Frequency

被引:0
|
作者
Tanaka, Tomoro [1 ]
Yatabe, Kohei [2 ]
Oikawa, Yasuhiro [1 ]
机构
[1] Waseda Univ, Dept Intermedia Art & Sci, Tokyo 168555, Japan
[2] Tokyo Univ Agr & Technol, Dept Elect Engn & Comp Sci, Tokyo 1848588, Japan
关键词
Optimization; Energy loss; Spectrogram; Speech processing; Reliability; Time-frequency analysis; Minimization; Instantaneous frequency; phase derivative; sparsity; primal-dual splitting; discrete Gabor transform; PACKET LOSS CONCEALMENT; DISCRETE-TIME SIGNALS; REPRESENTATIONS; INTERPOLATION; SPARSITY; ALGORITHMS; MODEL;
D O I
10.1109/TASLP.2024.3463415
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio inpainting restores locally corrupted parts of digital audio signals. Sparsity-based methods achieve this by promoting sparsity in the time-frequency (T-F) domain, assuming short-time audio segments consist of a few sinusoids. However, such sparsity promotion reduces the magnitudes of the resulting waveforms; moreover, it often ignores the temporal connections of sinusoidal components. To address these problems, we propose a novel phase-aware audio inpainting method. Our method minimizes the time variations of a particular T-F representation calculated using the time derivative of the phase. This promotes sinusoidal components that coherently fit in the corrupted parts without directly suppressing the magnitudes. Both objective and subjective experiments confirmed the superiority of the proposed method compared with state-of-the-art methods.
引用
收藏
页码:4471 / 4485
页数:15
相关论文
共 16 条