A Mixture Model Approach for Formant Tracking and the Robustness of Student's-t Distribution

被引:10
|
作者
Sundar, Harshavardhan [1 ]
Seelamantula, Chandra Sekhar [2 ]
Sreenivas, Thippur V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
[2] Indian Inst Sci, Dept Elect Engn, Bangalore 560012, Karnataka, India
关键词
Formant tracking; Gaussian mixture model (GMM); multimodal density estimation; statistical mixture modeling; Student's-t mixture model (tMM); SPEECH; FREQUENCY; SEPARATION;
D O I
10.1109/TASL.2012.2209418
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address the problem of robust formant tracking in continuous speech in the presence of additive noise. We propose a new approach based on mixture modeling of the formant contours. Our approach consists of two main steps: (i) Computation of a pyknogram based on multiband amplitude-modulation/frequency-modulation (AM/FM) decomposition of the input speech; and (ii) Statistical modeling of the pyknogram using mixture models. We experiment with both Gaussian mixture model (GMM) and Student's-t mixture model (tMM) and show that the latter is robust with respect to handling outliers in the pyknogram data, parameter selection, accuracy, and smoothness of the estimated formant contours. Experimental results on simulated data as well as noisy speech data show that the proposed tMM-based approach is also robust to additive noise. We present performance comparisons with a recently developed adaptive filterbank technique proposed in the literature and the classical Burg's spectral estimator technique, which show that the proposed technique is more robust to noise.
引用
收藏
页码:2626 / 2636
页数:11
相关论文
共 50 条
  • [1] Bounded Asymmetrical Student's-t Mixture Model
    Thanh Minh Nguyen
    Wu, Q. M. Jonathan
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (06) : 857 - 869
  • [2] MULTIVARIATE STUDENT'S-T MIXTURE MODEL FOR BOUNDED SUPPORT DATA
    Thanh Minh Nguyen
    Wu, Q. M. Jonathan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 5548 - 5552
  • [3] Image segmentation using a hierarchical student's-t mixture model
    Kong, Lingcheng
    Zhang, Hui
    Zheng, Yuhui
    Chen, Yunjie
    Zhu, Jiezhong
    Wu, Qingming M. Jonathan
    [J]. IET IMAGE PROCESSING, 2017, 11 (11) : 1094 - 1102
  • [4] EPLL image restoration with a bounded asymmetrical Student?s-t mixture model
    Yu, Qiqiong
    Cao, Guo
    Shi, Hao
    Zhang, Youqiang
    Fu, Peng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [5] A Multiphase Level Set Clustering Approach Using MRF-based Student's-t Mixture Model
    Xie, Qunyi
    Pan, Xu
    Zhu, Hongqing
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [6] A ROBUST NONSYMMETRIC STUDENT'S-T FINITE MIXTURE MODEL FOR MR IMAGE SEGMENTATION
    Pan, Xu
    Zhu, Hongqing
    Xie, Qunyi
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1830 - 1834
  • [7] Fast and robust image segmentation with active contours and Student's-t mixture model
    Gao, Guowei
    Wen, Chenglin
    Wang, Huibin
    [J]. PATTERN RECOGNITION, 2017, 63 : 71 - 86
  • [8] Effective multiview registration of point clouds based on Student?s-t mixture model
    Ma, Yanlin
    Zhu, Jihua
    Tian, Zhiqiang
    Li, Zhongyu
    [J]. INFORMATION SCIENCES, 2022, 608 : 137 - 152
  • [9] Unsupervised Image Categorization Based on Variational Autoencoder and Student's-T Mixture Model
    Zhang, Yu
    Fan, Wentao
    Bouguila, Nizar
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 2403 - 2409
  • [10] Robust stochastic configuration networks for industrial data modelling with Student's-t mixture distribution
    Yan, Aijun
    Guo, Jingcheng
    Wang, Dianhui
    [J]. INFORMATION SCIENCES, 2022, 607 : 493 - 505