Infinite Von Mises-Fisher Mixture Model and Its Application to Gene Expression Data Clustering

被引:0
|
作者
Zhu, Jiaojiao [1 ]
Fan, Wentao [1 ]
机构
[1] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen, Peoples R China
基金
中国国家自然科学基金;
关键词
Dirichlet Process; von-Mises Fisher Mixture Model; Clustering; Variational Inference; Kd Tree; Gene Expression; VARIATIONAL INFERENCE;
D O I
10.1145/3461353.3461364
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In some applications of data mining, clustering analysis of directional data is often involved. In this case, conventional model-based clustering methods are not suitable for fitting the data of such type of structure. Therefore, a Dirichlet Process Mixture Model based on von Mises-Fisher distribution was proposed for the clustering analysis of directional data. The main motivation is that as a non-parametric Bayesian model, The Dirichlet process can automatically determine the complexity of the mixture model when the number of data categories is unknown. We use the accelerated Variational inference algorithm to quickly estimate the parameters involved in the model, which enables the method to be applied in applications with large data scale. The validity of the proposed model was verified by using different scale simulation data and clustering analysis of gene expression data.
引用
收藏
页码:55 / 61
页数:7
相关论文
共 50 条
  • [41] A Dirichlet process mixture model for clustering longitudinal gene expression data
    Sun, Jiehuan
    Herazo-Maya, Jose D.
    Kaminski, Naftali
    Zhao, Hongyu
    Warren, Joshua L.
    [J]. STATISTICS IN MEDICINE, 2017, 36 (22) : 3495 - 3506
  • [42] From Alternative Clustering to Robust Clustering and Its Application to Gene Expression Data
    Peng, Peter
    Nagi, Mohamad
    Sair, Omer
    Suleiman, Iyad
    Qabaja, Ala
    ElSheikh, Abdallah M.
    Gao, Shang
    Ozyer, Tansel
    Kianmehr, Keivan
    Naji, Ghada
    Ridley, Mick
    Rokne, Jon
    Alhajj, Reda
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2011, 2011, 6936 : 421 - +
  • [43] Application of the von Mises-Fisher distribution to Random Walk on Spheres method for solving high-dimensional diffusion-advection-reaction equations
    Sabelfeld, Karl K.
    [J]. STATISTICS & PROBABILITY LETTERS, 2018, 138 : 137 - 142
  • [44] Spherical data clustering and feature selection through nonparametric Bayesian mixture models with von Mises distributions
    Fan, Wentao
    Bouguila, Nizar
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 94
  • [45] Consensus clustering of gene expression data and its application to gene function prediction
    Xiao, Guanghua
    Pan, Wei
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2007, 16 (03) : 733 - 751
  • [46] Clustering of gene expression data by mixture of PCA models
    Yoshioka, T
    Morioka, R
    Kobayashi, K
    Oba, S
    Ogawsawara, N
    Ishii, S
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 522 - 527
  • [47] Clustering Circular Data via Finite Mixtures of von Mises Distributions and an Application to Data on Wind Directions
    Jammalamadaka, S. Rao
    Vaidyanathan, V. S.
    [J]. SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2024, 86 (01): : 575 - 595
  • [48] Clustering Circular Data via Finite Mixtures of von Mises Distributions and an Application to Data on Wind Directions
    S. Rao Jammalamadaka
    V. S. Vaidyanathan
    [J]. Sankhya A, 2024, 86 : 575 - 595
  • [49] Order-preserving clustering and its application to gene expression data
    Syeda-Mahmood, T
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 637 - 640
  • [50] A recursively partitioned mixture model for clustering time-course gene expression data
    Koestler, Devin C.
    Marsit, Carmen J.
    Christensen, Brock C.
    Kelsey, Karl T.
    Houseman, E. Andres
    [J]. TRANSLATIONAL CANCER RESEARCH, 2014, 3 (03) : 217 - +