Experiments on Automatic Language Identification for Philippine Languages using Acoustic Gaussian Mixture Models

被引:0
|
作者
Laguna, Ann Franchesca [1 ]
Guevara, Rowena Cristina [1 ]
机构
[1] Univ Philippines Diliman, Digital Signal Proc Lab, Quezon City, Philippines
关键词
Language Identification; Acoustic; Philippine Languages; GMM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A Philippine LID system has not been previously created because of the limited amount of recorded speech data. This research initiates the LID research using the Philippine Language Database (PLD) collected by the Digital Signal Processing Laboratory of the University of the Philippines Diliman (DSP-UPD). Mel Frequency Cepstral Coefficients (MFCC), Perceptual Linear Prediction (PLP), Shifted Delta Cepstra (SDC) and Linear Predictive Cepstral Coefficients (LPCC) features are extracted from the speech segments. Gaussian Mixture Model (GMM) using Expectation Maximization (EM) and Universal Background Model (UBM) approach is used to model the acoustic characteristics of the language. Maximum a Posteriori (MAP) probability is then used to determine the language of a speech utterance based on the language GMMs. PLP using a 16 Mixture GMM-EM has been found to produce the best performance among the four feature vectors in discriminating the languages.
引用
收藏
页码:657 / 662
页数:6
相关论文
共 50 条
  • [31] Coding using Gaussian mixture and generalized Gaussian models
    Su, JK
    Mersereau, RM
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, PROCEEDINGS - VOL I, 1996, : 217 - 220
  • [32] Automatic speaker recognition using a unique personal feature vector and Gaussian Mixture Models
    Kaminski, Kamil
    Majda, Ewelina
    Dobrowolski, Andrzej P.
    [J]. 2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 220 - 225
  • [33] Advertisement Detection in French Broadcast News Using Acoustic Repetition and Gaussian Mixture Models
    Gupta, Vishwa
    Boulianne, Gilles
    Kenny, Patrick
    Dumouchel, Pierre
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2538 - 2541
  • [34] Automatic genre classification of TV programmes using Gaussian mixture models and neural networks
    Montagnuolo, Maurizio
    Messina, Alberto
    [J]. DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 99 - +
  • [35] Automatic object trajectory-based motion recognition using Gaussian mixture models
    Bashir, F
    Khokhar, A
    Schonfeld, D
    [J]. 2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 1533 - 1536
  • [36] Modeling phase spectra using Gaussian mixture models for human face identification
    Mitra, S
    Savvides, M
    Brockwell, A
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3687 : 174 - 182
  • [37] Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models
    Pathak, Manas A.
    Raj, Bhiksha
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02): : 397 - 406
  • [38] Magnetizing inrush current identification using wavelet based gaussian mixture models
    Jazebi, S.
    Vahidi, B.
    Hosseinian, S. H.
    Faiz, J.
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2009, 17 (06) : 991 - 1010
  • [39] Speaker identification using a novel combination of sparse representation and Gaussian mixture models
    Ma Yunjie
    [J]. AUTOMATIC CONTROL AND MECHATRONIC ENGINEERING III, 2014, 615 : 265 - 269
  • [40] An Improved Speaker Identification System Using Automatic Split-Merge Incremental Learning (A-SMILE) of Gaussian Mixture Models
    Bouziane, Ayoub
    Kharroubi, Jamal
    Zarghili, Arsalane
    [J]. ARTIFICIAL INTELLIGENCE TRENDS IN INTELLIGENT SYSTEMS, CSOC2017, VOL 1, 2017, 573 : 364 - 373