Adjusting mixture weights of Gaussian mixture model via regularized probabilistic latent semantic analysis

被引:0
|
作者
Si, L
Jin, R
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS | 2005年 / 3518卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mixture models, such as Gaussian Mixture Model, have been widely used in many applications for modeling data. Gaussian mixture model (GMM) assumes that data points are generated from a set of Gaussian models with the same set of mixture weights. A natural extension of GMM is the probabilistic latent semantic analysis (PLSA) model, which assigns different mixture weights for each data point. Thus, PLSA is more flexible than the GMM method. However, as a tradeoff, PLSA usually suffers from the overfitting problem. In this paper, we propose a regularized probabilistic latent semantic analysis model (RPLSA), which can properly adjust the amount of model flexibility so that not only the training data can be fit well but also the model is robust to avoid the overfitting problem. We conduct empirical study for the application of speaker identification to show the effectiveness of the new model. The experiment results on the NIST speaker recognition dataset indicate that the RPLSA model outperforms both the GMM and PLSA models substantially. The principle of RPLSA of appropriately adjusting model flexibility can be naturally extended to other applications and other types of mixture models.
引用
收藏
页码:622 / 631
页数:10
相关论文
共 50 条
  • [21] A Gaussian Mixture Model for Statistical Timing Analysis
    Takahashi, Shingo
    Yoshida, Yuki
    Tsukiyama, Shuji
    DAC: 2009 46TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, VOLS 1 AND 2, 2009, : 110 - +
  • [22] A DUAL EM ALGORITHM FOR TV REGULARIZED GAUSSIAN MIXTURE MODEL IN IMAGE SEGMENTATION
    Yan, Shi
    Liu, Jun
    Huang, Haiyang
    Tai, Xue-Cheng
    INVERSE PROBLEMS AND IMAGING, 2019, 13 (03) : 653 - 677
  • [23] A latent-class mixture model for incomplete longitudinal Gaussian data
    Beunckens, Caroline
    Molenberghs, Geert
    Verbeke, Geert
    Mallinckrodt, Craig
    BIOMETRICS, 2008, 64 (01) : 96 - 105
  • [24] Model-based classification using latent Gaussian mixture models
    McNicholas, Paul D.
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (05) : 1175 - 1181
  • [25] Regularized Subspace Gaussian Mixture Models for Speech Recognition
    Lu, Liang
    Ghoshal, Arnab
    Renals, Steve
    IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (07) : 419 - 422
  • [26] Latent Gaussian random field mixture models
    Bolin, David
    Wallin, Jonas
    Lindgren, Finn
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 130 : 80 - 93
  • [27] Probabilistic Forecasting of Bus Travel Time with a Bayesian Gaussian Mixture Model
    Chen, Xiaoxu
    Cheng, Zhanhong
    Jin, Jian Gang
    Trepanier, Martin
    Sun, Lijun
    TRANSPORTATION SCIENCE, 2023, 57 (06) : 1516 - 1535
  • [28] Mixture Gaussian process model with Gaussian mixture distribution for big data
    Guan, Yaonan
    He, Shaoying
    Ren, Shuangshuang
    Liu, Shuren
    Li, Dewei
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 253
  • [29] A Probabilistic Quality-Relevant Monitoring Method With Gaussian Mixture Model
    Yu, Wanke
    Zhao, Chunhui
    Huang, Biao
    Yang, Hui
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, : 1 - 12
  • [30] An Analytical Method for Probabilistic Load Flow using Gaussian Mixture Model
    Zhang Jun
    Zhang Shuang
    Gao Feng
    Li Xutao
    Li Hongqiang
    Wang Zhiwen
    Shen Chen
    2016 IEEE INTERNATIONAL CONFERENCE ON POWER SYSTEM TECHNOLOGY (POWERCON), 2016,