Application of variational Bayesian PCA for speech feature extraction

被引:0
|
作者
Kwon, OW [1 ]
Lee, TW [1 ]
Chan, KL [1 ]
机构
[1] Univ Calif San Diego, Inst Neural Computat, La Jolla, CA 92059 USA
来源
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS | 2002年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In a standard mel-frequency cepstral coefficient-based speech recognizer, it is common to use the same feature dimension and the number of Gaussian mixtures for all subunits. We proposed to use different transformations and different number of mixtures for each subunit. We obtained the transformations from mel-frequency band energies by using the variational Bayesian principal component analysis (PCA) method. In the method, hyperparameters of the Gaussian mixtures and the number of mixtures are automatically learned through maximization of a lower bound of the evidence instead of the likelihood in the conventional maximum likelihood paradigm. Analyzing the TIMIT speech data, we revealed intrinsic structures of vowels and consonants. We demonstrated the usefulness of the method for speech recognition by performing phoneme classification of /b/, /d/ and /g/ phonemes.
引用
收藏
页码:825 / 828
页数:4
相关论文
共 50 条
  • [21] LIP FEATURE EXTRACTION BASED ON PCA
    Liu, He
    Li, Qianyu
    FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2012), 2012, : 21 - 25
  • [22] Condition for Perfect Dimensionality Recovery by Variational Bayesian PCA
    Nakajima, Shinichi
    Tomioka, Ryota
    Sugiyama, Masashi
    Babacan, S. Derin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 3757 - 3811
  • [23] Variational Bayesian Approach to Condition-Invariant Feature Extraction for Visual Place Recognition
    Oh, Junghyun
    Eoh, Gyuho
    APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [24] Kernel-based feature extraction with a speech technology application
    Kocsor, A
    Tóth, L
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2250 - 2263
  • [26] PCA Feature Frequency Extraction Algorithm Based on SVD Principle and Its Application
    Guo M.
    Li W.
    Yang Q.
    Zhao X.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2020, 48 (01): : 1 - 9
  • [27] Deep Bayesian Slow Feature Extraction With Application to Industrial Inferential Modeling
    Jiang, Chao
    Lu, Yusheng
    Zhong, Weimin
    Huang, Biao
    Tan, Dayu
    Song, Wenjiang
    Qian, Feng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) : 40 - 51
  • [28] Variational Bayesian learning for speech modeling and enhancement
    Huang, Qinghua
    Yang, Jie
    Wei, Shoushui
    SIGNAL PROCESSING, 2007, 87 (09) : 2026 - 2035
  • [29] Variational Bayesian estimation and clustering for speech recognition
    Watanabe, S
    Minami, Y
    Nakamura, A
    Ueda, N
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (04): : 365 - 381
  • [30] Robust feature extraction using kernel PCA
    Takiguchi, Tetsuya
    Ariki, Yasuo
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 509 - 512