Modeling speech signals in the time-frequency domain using GARCH

被引:21
|
作者
Cohen, I [1 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
关键词
speech modeling; time-frequency analysis; GARCH;
D O I
10.1016/j.sigpro.2004.09.001
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we introduce a novel modeling approach for speech signals in the short-time Fourier transform (STFT) domain. We define the conditional variance of the STFT expansion coefficients, and model the one-frame-ahead conditional variance as a generalized autoregressive conditional heteroscedasticity (GARCH) process. The proposed approach offers a reasonable model on which to base the estimation of the variances of the STFT expansion coefficients, while taking into consideration their heavy-tailed distribution. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:2453 / 2459
页数:7
相关论文
共 50 条
  • [1] Watermarking of speech signals in the time-frequency domain
    Al-Khassaweneh, Mahmood
    Al-Zoubi, Hussein
    Aviyente, Selin
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2009, : 317 - +
  • [2] An approach to digital watermarking of speech signals in the time-frequency domain
    Stankovic, Srdjan
    Orovic, Irena
    Zaric, Nikola
    Ioana, Cornel
    [J]. PROCEEDINGS ELMAR-2006, 2006, : 127 - 130
  • [3] A Time-Frequency Domain Formant Frequency Estimation Scheme for Noisy Speech Signals
    Fattah, S. A.
    Zhu, W-P.
    Ahmad, M. O.
    [J]. ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 1201 - 1204
  • [4] The multi-bit watermarking method for speech signals in the time-frequency domain
    Al-khassaweneh, Mahmood
    Al-zoubi, Hussien
    Aviyente, Selin
    [J]. INTEGRATED COMPUTER-AIDED ENGINEERING, 2010, 17 (01) : 59 - 67
  • [5] On timing in time-frequency analysis of speech signals
    Yegnanarayana, B
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1996, 21 : 5 - 20
  • [6] Representation of fine structure of speech signals using time-frequency zeros
    Okada, A
    Ono, N
    Ando, S
    [J]. SICE 2004 ANNUAL CONFERENCE, VOLS 1-3, 2004, : 2357 - 2360
  • [7] Segmentation on time-frequency domain for speech segregation
    Lim, Sung-Kil
    Lee, Hyon-Soo
    [J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 433 - +
  • [8] Neural speech enhancement in the time-frequency domain
    Volkmer, M
    [J]. 2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 617 - 626
  • [9] Speech presence detection in the time-frequency domain using minimum statistics
    Sorensen, KV
    Andersen, SV
    [J]. NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 340 - 343
  • [10] Joint Time-Frequency and Time Domain Learning for Speech Enhancement
    Tang, Chuanxin
    Luo, Chong
    Zhao, Zhiyuan
    Xie, Wenxuan
    Zeng, Wenjun
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3816 - 3822