Improved phase vocoder time-scale modification of audio

被引:138
|
作者
Laroche, J [1 ]
Dolson, M [1 ]
机构
[1] Joint Emu Creat Technol Ctr, Scotts Valley, CA 95067 USA
来源
关键词
phase coherence; phase vocoder; pitch shifting; short time Fourier transform; time scaling;
D O I
10.1109/89.759041
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The phase vocoder is a well-established tool for time scaling and pitch shifting speech and audio signals via modification of their short-time Fourier transforms (STFT's). In contrast to time-domain time-scaling and pitch-shifting techniques, the phase vocoder is generally considered to yield high quality results, especially for large modification factors and/or polyphonic signals. However, the phase vocoder is also known for introducing a characteristic perceptual artifact, often described as "phasiness," "reverberation," or "loss of presence." This paper examines the problem of phasiness in the context of time-scale modification and provides new insights into its causes. Two extensions to the standard phase vocoder algorithm are introduced, and the resulting sound quality is shown to be significantly improved. Moreover, the modified phase vocoder is shown to provide a factor-of-two decrease in computational cost.
引用
收藏
页码:323 / 332
页数:10
相关论文
共 50 条
  • [21] Time-scale invariant audio data embedding
    Mansour, MF
    Tewfik, AH
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (10) : 993 - 1000
  • [22] Speech Time-Scale Modification With GANs
    Cohen, Eyal
    Kreuk, Felix
    Keshet, Joseph
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1067 - 1071
  • [23] Time-scale modification of speech signals
    Ninness, Brett
    Henriksen, Soren John
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (04) : 1479 - 1488
  • [24] Frequency Dependent Time-Scale Modification
    Roberts, Timothy
    Paliwal, Kuldip K.
    [J]. 2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [25] Time-scale modification of music signals
    Grofit, S
    Lavner, Y
    [J]. 22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 254 - 256
  • [26] A Review of Time-Scale Modification of Music Signals
    Driedger, Jonathan
    Mueller, Meinard
    [J]. APPLIED SCIENCES-BASEL, 2016, 6 (02):
  • [27] TIME-SCALE MODIFICATION OF AUDIO SIGNALS USING MULTI-RELATIVE ONSET TIME ESTIMATIONS IN SINUSOIDAL TRANSFORM CODING
    Kim, Jonathan
    Clements, Mark
    [J]. 2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 558 - 561
  • [28] BLIND AUDIO SEPARATION AND CONTENT ANALYSIS IN THE TIME-SCALE DOMAIN
    Jbari, Atman
    Adib, Abdellah
    Aboutajdine, Driss
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2007, 1 (03) : 307 - 318
  • [29] Time-Scale Atoms Chains for Transients Detection in Audio Signals
    Bruni, Vittoria
    Marconi, Silvia
    Vitulano, Domenico
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 420 - 433
  • [30] A time-scale modification dataset with subjective quality labels
    Roberts, Timothy
    Paliwal, Kuldip K.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 148 (01): : 201 - 210