Real-time signal estimation from modified short-time Fourier transform magnitude spectra

被引:85
|
作者
Zhu, Xinglei [1 ]
Beauregard, Gerald T.
Wyse, Lonce L.
机构
[1] Inst Infocomm Res, Media Understanding Dept, Singapore 119613, Singapore
[2] Natl Univ Singapore, Fac Arts & Soc Sci, Singapore 119077, Singapore
关键词
magnitude-only reconstruction; real-time systems; signal estimation; spectrogram inversion; time-scale modification (TSM);
D O I
10.1109/TASL.2007.899236
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An algorithm for estimating signals from short-time magnitude spectra is introduced offering a significant improvement in quality and efficiency over current methods. The key issue is how to invert a sequence of overlapping magnitude spectra (a "spectrogram") containing no phase information to generate a real-valued signal free of audible artifacts. Also important is that the algorithm performs in real-time, both structurally and computationally. In the context of spectrogram inversion, structurally real-time means that the audio signal at any given point in time only depends on transform frames at local or prior points in time. Computationally, real-time means that the algorithm is efficient enough to run in less time than the reconstructed audio takes to play on the available hardware. The spectrogram inversion algorithm is parameterized to allow tradeoffs between computational demands and the quality of the signal reconstruction. The algorithm is applied to audio time-scale and pitch modification and compared to classical algorithms for these tasks on a variety of signal types including both monophonic and polyphonic audio signals such as speech and music.
引用
收藏
页码:1645 / 1653
页数:9
相关论文
共 50 条
  • [1] SIGNAL ESTIMATION FROM MODIFIED SHORT-TIME FOURIER-TRANSFORM
    GRIFFIN, DW
    LIM, JS
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (02): : 236 - 243
  • [2] SIGNAL RECONSTRUCTION FROM SHORT-TIME FOURIER-TRANSFORM MAGNITUDE
    NAWAB, SH
    QUATIERI, TF
    LIM, JS
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1983, 31 (04): : 986 - 998
  • [3] An architectural comparison of signal reconstruction algorithms from short-time Fourier transform magnitude spectra
    Chami, Mouhcine
    Immassi, Maryem
    Di Martino, Joseph
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (03) : 433 - 441
  • [4] An Incremental Algorithm for Signal Reconstruction from Short-Time Fourier Transform Magnitude
    Bouvrie, Jake
    Ezzat, Tony
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2510 - +
  • [5] Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra
    Alsteris, Leigh D.
    Paliwal, Kuldip K.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (01): : 174 - 186
  • [6] RECOVERING SIGNALS FROM THE SHORT-TIME FOURIER TRANSFORM MAGNITUDE
    Jaganathan, Kishore
    Eldar, Yonina C.
    Hassibi, Babak
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3277 - 3281
  • [7] FREQUENCY SAMPLING OF THE SHORT-TIME FOURIER-TRANSFORM MAGNITUDE FOR SIGNAL RECONSTRUCTION
    QUATIERI, TF
    NAWAB, SH
    LIM, JS
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1983, 73 (11) : 1523 - 1526
  • [9] MODIFIED SHORT-TIME FOURIER-TRANSFORM
    WANG, MS
    BAO, Z
    [J]. OPTICAL ENGINEERING, 1995, 34 (05) : 1333 - 1337
  • [10] On Phase-Magnitude Relationships in the Short-Time Fourier Transform
    Auger, Francois
    Chassande-Mottin, Eric
    Flandrin, Patrick
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2012, 19 (05) : 267 - 270