AM-FM Estimation for Speech Based on a Time-Varying Sinusoidal Model

被引：0

作者：

Pantazis, Yannis

Rosec, Olivier

Stylianou, Yannis

机构：

来源：

INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5 | 2009年

关键词：

Sinusoidal modeling; AM-FM demodulation; Speech analysis; Speech reconstruction;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we present a method based on a time-varying sinusoidal model for a robust and accurate estimation of amplitude and frequency modulations (AM-FM) in speech. The suggested approach has two main steps. First, speech is modeled as a sinusoidal model with time-varying amplitudes. Specifically, the model makes use of a first order time polynomial with complex coefficients for capturing instantaneous amplitude and frequency (phase) components. Next, the model parameters are updated by using the previously estimated instantaneous phase information. Thus, an iterative scheme for AM-FM decomposition of speech is suggested which was validated on synthetic AM-FM signals and tested on reconstruction of voiced speech signals where the signal-to-error reconstruction ratio (SERR) was used as measure. Compared to the standard sinusoidal representation, the suggested approach found to improve the corresponding SERR by 47%, resulting in over 30 dB of SERR.

引用

页码：112 / 115

页数：4

共 50 条

[31] Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model
Nakatani, Tomohiro
Juang, Biing-Hwang
Yoshioka, Takuya
Kinoshita, Keisuke
Delcroix, Marc
Miyoshi, Masato
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1512 - 1527
[32] Frequency estimation of a sinusoidal signal with time-varying amplitude and phase
Vedyakov, Alexey A.
Vediakova, Anastasiia O.
Bobtsov, Alexey A.
Pyrkin, Anton A.
Kakanov, Mikhail A.
IFAC PAPERSONLINE, 2018, 51 (32): : 663 - 668
[33] Speaker Identification based on Robust AM-FM Features
Deshpande, Mangesh S.
Holambe, Raghunath S.
2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 62 - +
[34] Time-varying sinusoidal demodulation for non-stationary modeling of speech
Sharma, Neeraj Kumar
Sreenivas, Thippur V.
SPEECH COMMUNICATION, 2018, 105 : 77 - 91
[35] AM-FM decomposition of speech signals: An asymptotically exact approach based on the iterated Hilbert transform
Gianfelici, Francesco
Biagetti, Giorgio
Crippa, Paolo
Turchetti, Claudio
2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 301 - 305
[36] MRI brain image segmentation using an AM-FM model
Pattichis, MS
Petropoulos, H
Brooks, WM
CONFERENCE RECORD OF THE THIRTY-FOURTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2000, : 906 - 910
[37] Perception of speech in reverberant conditions using AM-FM cochlear implant simulation
Drgas, Szymon
Blaszak, Magdalena A.
HEARING RESEARCH, 2010, 269 (1-2) : 162 - 168
[38] INSTANTANEOUS PARAMETERS ESTIMATION ALGORITHM FOR NOISY AM-FM OSCILLATORY SIGNALS
Azarov, Elias
Vashkevich, Maxim
Petrovsky, Alexander
2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 989 - 993
[39] A model for time-varying quality of speech services
Chen, Z
Nakazato, H
GLOBECOM '05: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-6: DISCOVERY PAST AND FUTURE, 2005, : 240 - 244
[40] Frequency Estimation of Discrete-Time Sinusoidal Signals With Time-Varying Amplitude
Jiang, Teng
Cui, Guozeng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (05) : 2754 - 2758

← 1 2 3 4 5 →