Application of variational Bayesian PCA for speech feature extraction

被引:0
|
作者
Kwon, OW [1 ]
Lee, TW [1 ]
Chan, KL [1 ]
机构
[1] Univ Calif San Diego, Inst Neural Computat, La Jolla, CA 92059 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In a standard mel-frequency cepstral coefficient-based speech recognizer, it is common to use the same feature dimension and the number of Gaussian mixtures for all subunits. We proposed to use different transformations and different number of mixtures for each subunit. We obtained the transformations from mel-frequency band energies by using the variational Bayesian principal component analysis (PCA) method. In the method, hyperparameters of the Gaussian mixtures and the number of mixtures are automatically learned through maximization of a lower bound of the evidence instead of the likelihood in the conventional maximum likelihood paradigm. Analyzing the TIMIT speech data, we revealed intrinsic structures of vowels and consonants. We demonstrated the usefulness of the method for speech recognition by performing phoneme classification of /b/, /d/ and /g/ phonemes.
引用
收藏
页码:825 / 828
页数:4
相关论文
共 50 条
  • [31] PCA feature extraction for protein structure prediction
    Melo, JCB
    Cavalcanti, GDC
    Guimaraes, KS
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2952 - 2957
  • [32] ICA and PCA Integrated Feature Extraction for Classification
    Reza, Md Shamim
    Ma, Jinwen
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1083 - 1088
  • [33] A variational Bayesian framework for group feature selection
    Niranjan Subrahmanya
    Yung C. Shin
    International Journal of Machine Learning and Cybernetics, 2013, 4 : 609 - 619
  • [34] Application of EEMD and Hilbert marginal spectrum in speech emotion feature extraction
    Xiang Lei
    Xiong Weihua
    Li Junfeng
    Ji Ruisong
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 3686 - 3689
  • [35] Feature extraction and denoising using kernel PCA
    Jade, AM
    Srikanth, B
    Jayaraman, VK
    Kulkarni, BD
    Jog, JP
    Priya, L
    CHEMICAL ENGINEERING SCIENCE, 2003, 58 (19) : 4441 - 4448
  • [36] Kernel PCA for feature extraction with information complexity
    Liu, ZQ
    Bozdogan, H
    STATISTICAL DATA MINING AND KNOWLEDGE DISCOVERY, 2004, : 309 - 322
  • [37] A variational Bayesian framework for group feature selection
    Subrahmanya, Niranjan
    Shin, Yung C.
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2013, 4 (06) : 609 - 619
  • [38] Variational Feature Extraction in Scientific Visualization
    Dassler, Nico
    Guenther, Tobias
    ACM TRANSACTIONS ON GRAPHICS, 2024, 43 (04):
  • [39] Visual speech feature extraction for improved speech recognition
    Zhang, X
    Mersereau, RM
    Clements, M
    Broun, CC
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1993 - 1996
  • [40] Mixture Probabilistic PCA for Process Monitoring - Collapsed Variational Bayesian Approach
    Raveendran, Rahul
    Huang, Biao
    IFAC PAPERSONLINE, 2016, 49 (07): : 1032 - 1037