AM-FM Estimation for Speech Based on a Time-Varying Sinusoidal Model

被引:0
|
作者
Pantazis, Yannis
Rosec, Olivier
Stylianou, Yannis
机构
关键词
Sinusoidal modeling; AM-FM demodulation; Speech analysis; Speech reconstruction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a method based on a time-varying sinusoidal model for a robust and accurate estimation of amplitude and frequency modulations (AM-FM) in speech. The suggested approach has two main steps. First, speech is modeled as a sinusoidal model with time-varying amplitudes. Specifically, the model makes use of a first order time polynomial with complex coefficients for capturing instantaneous amplitude and frequency (phase) components. Next, the model parameters are updated by using the previously estimated instantaneous phase information. Thus, an iterative scheme for AM-FM decomposition of speech is suggested which was validated on synthetic AM-FM signals and tested on reconstruction of voiced speech signals where the signal-to-error reconstruction ratio (SERR) was used as measure. Compared to the standard sinusoidal representation, the suggested approach found to improve the corresponding SERR by 47%, resulting in over 30 dB of SERR.
引用
收藏
页码:112 / 115
页数:4
相关论文
共 50 条
  • [31] Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model
    Nakatani, Tomohiro
    Juang, Biing-Hwang
    Yoshioka, Takuya
    Kinoshita, Keisuke
    Delcroix, Marc
    Miyoshi, Masato
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1512 - 1527
  • [32] Frequency estimation of a sinusoidal signal with time-varying amplitude and phase
    Vedyakov, Alexey A.
    Vediakova, Anastasiia O.
    Bobtsov, Alexey A.
    Pyrkin, Anton A.
    Kakanov, Mikhail A.
    IFAC PAPERSONLINE, 2018, 51 (32): : 663 - 668
  • [33] Speaker Identification based on Robust AM-FM Features
    Deshpande, Mangesh S.
    Holambe, Raghunath S.
    2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
  • [34] Time-varying sinusoidal demodulation for non-stationary modeling of speech
    Sharma, Neeraj Kumar
    Sreenivas, Thippur V.
    SPEECH COMMUNICATION, 2018, 105 : 77 - 91
  • [35] AM-FM decomposition of speech signals: An asymptotically exact approach based on the iterated Hilbert transform
    Gianfelici, Francesco
    Biagetti, Giorgio
    Crippa, Paolo
    Turchetti, Claudio
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 301 - 305
  • [36] MRI brain image segmentation using an AM-FM model
    Pattichis, MS
    Petropoulos, H
    Brooks, WM
    CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2000, : 906 - 910
  • [37] Perception of speech in reverberant conditions using AM-FM cochlear implant simulation
    Drgas, Szymon
    Blaszak, Magdalena A.
    HEARING RESEARCH, 2010, 269 (1-2) : 162 - 168
  • [38] INSTANTANEOUS PARAMETERS ESTIMATION ALGORITHM FOR NOISY AM-FM OSCILLATORY SIGNALS
    Azarov, Elias
    Vashkevich, Maxim
    Petrovsky, Alexander
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 989 - 993
  • [39] A model for time-varying quality of speech services
    Chen, Z
    Nakazato, H
    GLOBECOM '05: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6: DISCOVERY PAST AND FUTURE, 2005, : 240 - 244
  • [40] Frequency Estimation of Discrete-Time Sinusoidal Signals With Time-Varying Amplitude
    Jiang, Teng
    Cui, Guozeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (05) : 2754 - 2758