Time-scale modification of audio signals using enhanced WSOLA with management of transients

被引:25
|
作者
Grofit, Shahaf [1 ]
Lavner, Yizhar [2 ,3 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Tel Hai Acad Coll, Dept Comp Sci, IL-12210 Upper Galilee, Israel
[3] Technion Israel Inst Technol, Fac Elect Engn, SIPL, IL-32000 Haifa, Israel
关键词
Mel frequency cepstrum; spectral variation; time-scale modification of audio and music signals; waveform similarity overlap-and-add (WSOLA);
D O I
10.1109/TASL.2007.909444
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an algorithm for time-scale modification of music signals, based on the waveform similarity overlap-and-add technique (WSOLA). A well-known disadvantage of the standard WSOLA is the uniform time-scaling of the entire signal, including the perceptually significant transient sections (PSTs), where temporal envelope changes as well as significant spectral transitions occur. Time-scaling of PSTs can severely degrade the music quality. We address this problem by detecting the PSTs and leaving them intact, while time-scaling the remainder of the signal, which is relatively steady-state. In the proposed algorithm, the PSTs are detected using a Mel frequency cepstrum nonstationarity measure and the normalized cross-correlation, with time-varying threshold functions. Our study shows that the accurate detection of PSTs within the WSOLA framework makes it possible to achieve a higher quality of time-scaled music, as confirmed by subjective listening tests.
引用
收藏
页码:106 / 115
页数:10
相关论文
共 50 条
  • [21] A hybrid time-frequency domain approach to audio time-scale modification
    Dorran, D
    Lawlor, R
    Coyle, E
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2006, 54 (1-2): : 21 - 31
  • [22] Energy-based nonuniform time-scale compression of audio signals
    Chu, WC
    Lashkari, K
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2003, 49 (01) : 183 - 187
  • [23] A new frequency domain approach to time-scale expansion of audio signals
    Ferreira, AJS
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3577 - 3580
  • [24] Localized audio watermarking technique robust against time-scale modification
    Li, W
    Xue, XY
    Lu, PZ
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (01) : 60 - 69
  • [25] An efficient audio time-scale modification algorithm for use in a subband implementation
    Dorran, D
    Lawlor, R
    [J]. DAFX-03: 6TH INTERNATIONAL CONFERENCE ON DIGITAL AUDIO EFFECTS, PROCEEDINGS, 2003, : 339 - 343
  • [26] FastMPEG: Time-scale modification of bit-compressed audio information
    Covell, M
    Slaney, M
    Rothstein, A
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 3261 - 3264
  • [27] Transients detection in the time-scale domain
    Bruni, V.
    Vitulano, D.
    [J]. IMAGE AND SIGNAL PROCESSING, 2008, 5099 : 254 - 262
  • [28] Improving Time-Scale Modification of Music Signals Using Harmonic-Percussive Separation
    Driedger, Jonathan
    Mueller, Meinard
    Ewert, Sebastian
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (01) : 105 - 109
  • [29] An odd-DFT based approach to time-scale expansion of audio signals
    Ferreira, AJS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (04): : 441 - 453
  • [30] TIME-SCALE MODIFICATION OF SPEECH SIGNALS FOR SUPPORTING HEARING IMPAIRED SCHOOLCHILDREN
    Kupryjanow, Adam
    Czyzewski, Andrzej
    [J]. SPA 2009: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2009, : 159 - 162