Unsupervised Learning of Finite Mixtures using Scaled Dirichlet Distribution and its Application to Software Modules categorization

被引:0
|
作者
Oboh, Eromonsele Samuel [1 ]
Bouguila, Nizar [1 ]
机构
[1] Concordia Univ, CIISE, Montreal, PQ H3G 2W1, Canada
关键词
Data clustering; Mixture models; Scaled Dirichlet distribution; Unsupervised learning; Software modules categorization;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
We have designed and implemented an unsupervised learning algorithm for finite mixture model using the scaled Dirichlet distribution for multivariate proportional data. In this paper, the task of learning finite mixture model involves estimation of model parameters as well as inferring the hidden class information of our observed data. We made use of the expectation maximization algorithm to find the maximum likelihood estimate of our model parameters. This work, aims to address the flexibility challenge of the Dirichlet distribution by introducing a distribution that adds to it a scale parameter. This is important, because there is growing need for models that can fully describe the intrinsic nature of datasets. In addition, we applied our learning algorithm to synthetic datasets as well as to address the challenge of detecting fault prone software modules. Our proposed algorithm, makes it possible to discover these fault prone modules by harnessing their complexity-based attribute information. Finally, we compare our proposed model classification results with those from the Gaussian and Dirichlet mixture models.
引用
收藏
页码:1085 / 1090
页数:6
相关论文
共 20 条
  • [1] Software modules categorization through likelihood and bayesian analysis of finite dirichlet mixtures
    Bouguila, Nizar
    Wang, Jian Han
    Hamza, A. Ben
    [J]. JOURNAL OF APPLIED STATISTICS, 2010, 37 (02) : 235 - 252
  • [2] Unsupervised learning of a finite mixture model based on the Dirichlet distribution and its application
    Bouguila, N
    Ziou, D
    Vaillancourt, J
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (11) : 1533 - 1543
  • [3] Predicting defect-prone software modules using shifted-scaled Dirichlet distribution
    Alsuroji, Rua
    Bouguila, Nizar
    Zamzami, Nuha
    [J]. 2018 FIRST IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE FOR INDUSTRIES (AI4I 2018), 2018, : 15 - 18
  • [4] Unsupervised Learning of Finite Mixtures Using Entropy Regularization and Its Application to Image Segmentation
    Lu, Zhiwu
    Peng, Yuxin
    Xiao, Jianguo
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 641 - 648
  • [5] IMAGE DATABASE CATEGORIZATION USING ROBUST UNSUPERVISED LEARNING OF FINITE GENERALIZED DIRICHLET MIXTURE MODELS
    Ben Ismail, M. Maher
    Frigui, Hichem
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [6] Data Clustering using Variational Learning of Finite Scaled Dirichlet Mixture Models
    Hieu Nguyen
    Azam, Muhammad
    Bouguila, Nizar
    [J]. 2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1391 - 1396
  • [7] A powreful finite mixture model based on the generalized Dirichlet distribution: Unsupervised learning and applications
    Bouguila, N
    Ziou, D
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, 2004, : 280 - 283
  • [8] Data Clustering using Online Variational Learning of Finite Scaled Dirichlet Mixture Models
    Nguyen, Hieu
    Kalra, Meeta
    Azam, Muhammad
    Bouguila, Nizar
    [J]. 2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 267 - 274
  • [9] Using unsupervised learning of a finite Dirichlet mixture model to improve pattern recognition applications
    Bouguila, N
    Ziou, D
    [J]. PATTERN RECOGNITION LETTERS, 2005, 26 (12) : 1916 - 1925
  • [10] Data Clustering Using Variational Learning of Finite Scaled Dirichlet Mixture Models with Component Splitting
    Hieu Nguyen
    Maanicshah, Kamal
    Azam, Muhammad
    Bouguila, Nizar
    [J]. IMAGE ANALYSIS AND RECOGNITION (ICIAR 2019), PT II, 2019, 11663 : 117 - 128