PHAIN: Audio Inpainting via Phase-Aware Optimization With Instantaneous Frequency

被引:0
|
作者
Tanaka, Tomoro [1 ]
Yatabe, Kohei [2 ]
Oikawa, Yasuhiro [1 ]
机构
[1] Waseda Univ, Dept Intermedia Art & Sci, Tokyo 168555, Japan
[2] Tokyo Univ Agr & Technol, Dept Elect Engn & Comp Sci, Tokyo 1848588, Japan
关键词
Optimization; Energy loss; Spectrogram; Speech processing; Reliability; Time-frequency analysis; Minimization; Instantaneous frequency; phase derivative; sparsity; primal-dual splitting; discrete Gabor transform; PACKET LOSS CONCEALMENT; DISCRETE-TIME SIGNALS; REPRESENTATIONS; INTERPOLATION; SPARSITY; ALGORITHMS; MODEL;
D O I
10.1109/TASLP.2024.3463415
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Audio inpainting restores locally corrupted parts of digital audio signals. Sparsity-based methods achieve this by promoting sparsity in the time-frequency (T-F) domain, assuming short-time audio segments consist of a few sinusoids. However, such sparsity promotion reduces the magnitudes of the resulting waveforms; moreover, it often ignores the temporal connections of sinusoidal components. To address these problems, we propose a novel phase-aware audio inpainting method. Our method minimizes the time variations of a particular T-F representation calculated using the time derivative of the phase. This promotes sinusoidal components that coherently fit in the corrupted parts without directly suppressing the magnitudes. Both objective and subjective experiments confirmed the superiority of the proposed method compared with state-of-the-art methods.
引用
收藏
页码:4471 / 4485
页数:15
相关论文
共 16 条
  • [1] Phase-aware Audio Inpainting Based on Instantaneous Frequency
    Tanaka, Tomoro
    Yatabe, Kohei
    Oikawa, Yasuhiro
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 254 - 258
  • [2] Phase-Aware Optimization in Approximate Computing
    Mitra, Subrata
    Gupta, Manish K.
    Misailovic, Sasa
    Bagchi, Saurabh
    CGO'17: PROCEEDINGS OF THE 2017 INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2017, : 185 - 196
  • [3] Phase-Aware Transformations in Variational Autoencoders for Audio Effects
    Cámara, Mateo
    Blanco, José Luis
    AES: Journal of the Audio Engineering Society, 2022, 70 (09): : 731 - 741
  • [4] PHASE-AWARE HARMONIC/PERCUSSIVE SOURCE SEPARATION VIA CONVEX OPTIMIZATION
    Masuyama, Yoshiki
    Yatabe, Kohei
    Oikawa, Yasuhiro
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 985 - 989
  • [5] MAXIMALLY ENERGY-CONCENTRATED DIFFERENTIAL WINDOW FOR PHASE-AWARE SIGNAL PROCESSING USING INSTANTANEOUS FREQUENCY
    Kusano, Tsubasa
    Yatabe, Kohei
    Oikawa, Yasuhiro
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 5825 - 5829
  • [6] Complex ISNMF: A Phase-Aware Model for Monaural Audio Source Separation
    Magron, Paul
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 20 - 31
  • [7] Phase-Aware Audio Super-resolution for Music Signals Using Wasserstein Generative Adversarial Network
    Yan, Yanqiao
    Binh Thien Nguyen
    Geng, Yuting
    Iwai, Kenta
    Nishiura, Takanobu
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1673 - 1677
  • [8] LOW-RANKNESS OF COMPLEX-VALUED SPECTROGRAM AND ITS APPLICATION TO PHASE-AWARE AUDIO PROCESSING
    Masuyama, Yoshiki
    Yatabe, Kohei
    Oikawa, Yasuhiro
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 855 - 859
  • [9] COORDINATION OF POWER-FREQUENCY REGULATION VIA INSTANTANEOUS OPTIMIZATION
    SCHELLST.G
    PERFUMI, A
    ELETTROTECNICA, 1974, 61 (07): : 672 - 672
  • [10] UCRLF: unified constrained reinforcement learning framework for phase-aware architectures for autonomous vehicle signaling and trajectory optimization
    Chiranjib Sur
    Evolutionary Intelligence, 2019, 12 : 689 - 712