Modeling speech signals in the time-frequency domain using GARCH

被引：21

作者：

Cohen, I ^{[1
]}

机构：

[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel

来源：

SIGNAL PROCESSING | 2004年 / 84卷 / 12期

关键词：

speech modeling; time-frequency analysis; GARCH;

D O I：

10.1016/j.sigpro.2004.09.001

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we introduce a novel modeling approach for speech signals in the short-time Fourier transform (STFT) domain. We define the conditional variance of the STFT expansion coefficients, and model the one-frame-ahead conditional variance as a generalized autoregressive conditional heteroscedasticity (GARCH) process. The proposed approach offers a reasonable model on which to base the estimation of the variances of the STFT expansion coefficients, while taking into consideration their heavy-tailed distribution. (C) 2004 Elsevier B.V. All rights reserved.

引用

页码：2453 / 2459

页数：7

共 50 条

[1] Watermarking of speech signals in the time-frequency domain
Al-Khassaweneh, Mahmood
Al-Zoubi, Hussein
Aviyente, Selin
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2009, : 317 - +
[2] An approach to digital watermarking of speech signals in the time-frequency domain
Stankovic, Srdjan
Orovic, Irena
Zaric, Nikola
Ioana, Cornel
[J]. PROCEEDINGS ELMAR-2006, 2006, : 127 - 130
[3] A Time-Frequency Domain Formant Frequency Estimation Scheme for Noisy Speech Signals
Fattah, S. A.
Zhu, W-P.
Ahmad, M. O.
[J]. ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 1201 - 1204
[4] The multi-bit watermarking method for speech signals in the time-frequency domain
Al-khassaweneh, Mahmood
Al-zoubi, Hussien
Aviyente, Selin
[J]. INTEGRATED COMPUTER-AIDED ENGINEERING, 2010, 17 (01) : 59 - 67
[5] On timing in time-frequency analysis of speech signals
Yegnanarayana, B
[J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1996, 21 : 5 - 20
[6] Representation of fine structure of speech signals using time-frequency zeros
Okada, A
Ono, N
Ando, S
[J]. SICE 2004 ANNUAL CONFERENCE, VOLS 1-3, 2004, : 2357 - 2360
[7] Segmentation on time-frequency domain for speech segregation
Lim, Sung-Kil
Lee, Hyon-Soo
[J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 433 - +
[8] Neural speech enhancement in the time-frequency domain
Volkmer, M
[J]. 2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 617 - 626
[9] Speech presence detection in the time-frequency domain using minimum statistics
Sorensen, KV
Andersen, SV
[J]. NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 340 - 343
[10] Joint Time-Frequency and Time Domain Learning for Speech Enhancement
Tang, Chuanxin
Luo, Chong
Zhao, Zhiyuan
Xie, Wenxuan
Zeng, Wenjun
[J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3816 - 3822

← 1 2 3 4 5 →