Time-scale modification of audio signals using enhanced WSOLA with management of transients

被引:25
|
作者
Grofit, Shahaf [1 ]
Lavner, Yizhar [2 ,3 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, IL-69978 Tel Aviv, Israel
[2] Tel Hai Acad Coll, Dept Comp Sci, IL-12210 Upper Galilee, Israel
[3] Technion Israel Inst Technol, Fac Elect Engn, SIPL, IL-32000 Haifa, Israel
关键词
Mel frequency cepstrum; spectral variation; time-scale modification of audio and music signals; waveform similarity overlap-and-add (WSOLA);
D O I
10.1109/TASL.2007.909444
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an algorithm for time-scale modification of music signals, based on the waveform similarity overlap-and-add technique (WSOLA). A well-known disadvantage of the standard WSOLA is the uniform time-scaling of the entire signal, including the perceptually significant transient sections (PSTs), where temporal envelope changes as well as significant spectral transitions occur. Time-scaling of PSTs can severely degrade the music quality. We address this problem by detecting the PSTs and leaving them intact, while time-scaling the remainder of the signal, which is relatively steady-state. In the proposed algorithm, the PSTs are detected using a Mel frequency cepstrum nonstationarity measure and the normalized cross-correlation, with time-varying threshold functions. Our study shows that the accurate detection of PSTs within the WSOLA framework makes it possible to achieve a higher quality of time-scaled music, as confirmed by subjective listening tests.
引用
收藏
页码:106 / 115
页数:10
相关论文
共 50 条
  • [1] Time-Scale Atoms Chains for Transients Detection in Audio Signals
    Bruni, Vittoria
    Marconi, Silvia
    Vitulano, Domenico
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (03): : 420 - 433
  • [2] Data embedding in audio using time-scale modification
    Mansour, MF
    Tewfik, AH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (03): : 432 - 440
  • [3] Time-scale modification of audio signals with combined harmonic and wavelet representations
    Hamdy, KN
    Tewfik, AH
    Chen, T
    Takagi, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 439 - 442
  • [4] Audio watermarking by time-scale modification
    Mansour, MF
    Tewfik, AH
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1353 - 1356
  • [5] Complexity Reduction of WSOLA-Based Time-Scale Modification Using Signal Period Estimation
    Kim, Duk Su
    Lee, Young Han
    Kim, Hong Kook
    Choi, Song Ha
    Kim, Ji Woon
    Kim, Myeong Bo
    [J]. COMMUNICATION AND NETWORKING, PT II, 2010, 120 : 155 - +
  • [6] Time-scale modification of speech signals
    Ninness, Brett
    Henriksen, Soren John
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (04) : 1479 - 1488
  • [7] Time-scale modification of music signals
    Grofit, S
    Lavner, Y
    [J]. 22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 254 - 256
  • [8] Evaluation of Time-Scale Modification Methods for Audio Signals on Mobile Devices with Android OS
    Wlodarczyk, Michal
    Sekalski, Przemyslaw
    [J]. 2014 PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON MIXED DESIGN OF INTEGRATED CIRCUITS & SYSTEMS (MIXDES), 2014, : 451 - 454
  • [9] TIME-SCALE MODIFICATION OF AUDIO SIGNALS USING MULTI-RELATIVE ONSET TIME ESTIMATIONS IN SINUSOIDAL TRANSFORM CODING
    Kim, Jonathan
    Clements, Mark
    [J]. 2010 CONFERENCE RECORD OF THE FORTY FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2010, : 558 - 561
  • [10] A Review of Time-Scale Modification of Music Signals
    Driedger, Jonathan
    Mueller, Meinard
    [J]. APPLIED SCIENCES-BASEL, 2016, 6 (02):