AM-FM Estimation for Speech Based on a Time-Varying Sinusoidal Model

被引:0
|
作者
Pantazis, Yannis
Rosec, Olivier
Stylianou, Yannis
机构
关键词
Sinusoidal modeling; AM-FM demodulation; Speech analysis; Speech reconstruction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a method based on a time-varying sinusoidal model for a robust and accurate estimation of amplitude and frequency modulations (AM-FM) in speech. The suggested approach has two main steps. First, speech is modeled as a sinusoidal model with time-varying amplitudes. Specifically, the model makes use of a first order time polynomial with complex coefficients for capturing instantaneous amplitude and frequency (phase) components. Next, the model parameters are updated by using the previously estimated instantaneous phase information. Thus, an iterative scheme for AM-FM decomposition of speech is suggested which was validated on synthetic AM-FM signals and tested on reconstruction of voiced speech signals where the signal-to-error reconstruction ratio (SERR) was used as measure. Compared to the standard sinusoidal representation, the suggested approach found to improve the corresponding SERR by 47%, resulting in over 30 dB of SERR.
引用
收藏
页码:112 / 115
页数:4
相关论文
共 50 条
  • [1] AM/FM rate estimation for time-varying sinusoidal modeling
    Abe, M
    Smith, JO
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 201 - 204
  • [2] Speech analysis and synthesis using an AM-FM modulation model
    Potamianos, A
    Maragos, P
    SPEECH COMMUNICATION, 1999, 28 (03) : 195 - 209
  • [3] Comparison of AM-FM Based Features For Robust Speech Recognition
    Narayana, K. V. S.
    Sreenivas, T. V.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1545 - 1548
  • [4] Robust AM-FM features for speech recognition
    Dimitriadis, D
    Maragos, P
    Potamianos, A
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (09) : 621 - 624
  • [5] Overlapped Speech Detection Using AM-FM Based Time-Frequency Representations
    Baghel, Shikha
    Prasanna, S. R. M.
    Guha, Prithwijit
    SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 33 - 43
  • [6] An AM-FM model for motion estimation in atherosclerotic plaque videos
    Murray, V.
    Murillo, S. E.
    Pattichis, M. S.
    Loizou, C. P.
    Pattichis, C. S.
    Kyriacou, E.
    Nicolaides, A.
    CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 746 - +
  • [7] Demodulators for AM-FM models of speech signals: A comparison
    Lu, S
    Doerschuk, PC
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 263 - 266
  • [8] Temporal AM-FM combination for robust speech recognition
    Kubo, Yotaro
    Okawa, Shigeki
    Kurematsu, Akira
    Shirai, Katsuhiko
    SPEECH COMMUNICATION, 2011, 53 (05) : 716 - 725
  • [9] AM-FM modulation model of vocal tract and its application in speech analysis
    Zhang, Lei
    Han, Ji-Qing
    Wang, Cheng-Fa
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2002, 39 (06):
  • [10] STATISTICAL ANALYSIS OF AMPLITUDE MODULATION IN SPEECH SIGNALS USING AN AM-FM MODEL
    Tsiakoulis, Pirros
    Potamianos, Alexandros
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3981 - +