Application of variational Bayesian PCA for speech feature extraction

被引:0
|
作者
Kwon, OW [1 ]
Lee, TW [1 ]
Chan, KL [1 ]
机构
[1] Univ Calif San Diego, Inst Neural Computat, La Jolla, CA 92059 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In a standard mel-frequency cepstral coefficient-based speech recognizer, it is common to use the same feature dimension and the number of Gaussian mixtures for all subunits. We proposed to use different transformations and different number of mixtures for each subunit. We obtained the transformations from mel-frequency band energies by using the variational Bayesian principal component analysis (PCA) method. In the method, hyperparameters of the Gaussian mixtures and the number of mixtures are automatically learned through maximization of a lower bound of the evidence instead of the likelihood in the conventional maximum likelihood paradigm. Analyzing the TIMIT speech data, we revealed intrinsic structures of vowels and consonants. We demonstrated the usefulness of the method for speech recognition by performing phoneme classification of /b/, /d/ and /g/ phonemes.
引用
收藏
页码:825 / 828
页数:4
相关论文
共 50 条
  • [41] Statistical Monitoring of Wastewater Treatment Plants Using Variational Bayesian PCA
    Liu, Yiqi
    Pan, Yongping
    Sun, Zonghai
    Huang, Daoping
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2014, 53 (08) : 3272 - 3282
  • [42] Application of neural networks, PCA and feature extraction for prediction of nucleotide sequences by using genomic signals
    Cristea, Paul
    Mladenov, Valeri
    Tsenov, Georgi
    Tuduce, Rodica
    Petrakieva, Simona
    NEUREL 2008: NINTH SYMPOSIUM ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2008, : 78 - +
  • [43] Signal feature extraction by multi-scale PCA and its application to respiratory sound classification
    Xie, Shengkun
    Jin, Feng
    Krishnan, Sridhar
    Sattar, Farook
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2012, 50 (07) : 759 - 768
  • [44] Feature Extraction for Object Recognition using PCA-KNN with Application to Medical Image Analysis
    Kamencay, Patrik
    Hudec, Robert
    Benco, Miroslav
    Zachariasova, Martina
    2013 36TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2013, : 830 - 834
  • [45] The Efficiency of ICA-based Representation Analysis: Application to Speech Feature Extraction
    Du Jun
    Zou Xin
    Hao Jie
    Liu Ju
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (02): : 287 - 292
  • [46] Signal feature extraction by multi-scale PCA and its application to respiratory sound classification
    Shengkun Xie
    Feng Jin
    Sridhar Krishnan
    Farook Sattar
    Medical & Biological Engineering & Computing, 2012, 50 : 759 - 768
  • [47] Mutual Information Variational Autoencoders and Its Application to Feature Extraction of Multivariate Time Series
    Li, Junying
    Ren, Weijie
    Han, Min
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (06)
  • [48] Multiple Feature Extraction for RNN-based Assamese Speech Recognition for Speech to Text Conversion Application
    Dutta, Krishna
    Sarma, Kandarpa Kumar
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, DEVICES AND INTELLIGENT SYSTEMS (CODLS), 2012, : 600 - 603
  • [49] Optimizing feature extraction for speech recognition
    Lee, CH
    Hyun, DH
    Choi, ES
    Go, JW
    Lee, CY
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (01): : 80 - 87
  • [50] Human Speech Perception and Feature Extraction
    Lobdell, Bryce E.
    Hasegawa-Johnson, Mark A.
    Allen, Jont B.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1797 - 1800