A consistent procedure for determining the number of clusters in regression clustering

被引:18
|
作者
Shao, Q [1 ]
Wu, Y [1 ]
机构
[1] York Univ, Dept Math & Stat, N York, ON M3J 1P3, Canada
关键词
clustering; multiple regression; model selection; consistency;
D O I
10.1016/j.jspi.2004.04.021
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, an information-based criterion for determining the number of clusters in the problem of regression clustering is proposed. It is shown that, under a probabilistically structured population, the proposed criterion selects the true number of regression hyperplanes with probability one among all class-growing sequences of classifications, when the number of observations n from the population increases to infinity. Results from a simulation study are also presented. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:461 / 476
页数:16
相关论文
共 50 条
  • [11] Method for Determining the Optimal Number of Clusters Based on Agglomerative Hierarchical Clustering
    Zhou, Shibing
    Xu, Zhenyuan
    Liu, Fei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 3007 - 3017
  • [12] A density-peak-based clustering algorithm of automatically determining the number of clusters
    Tong, Wuning
    Liu, Sen
    Gao, Xiao-Zhi
    NEUROCOMPUTING, 2021, 458 : 655 - 666
  • [13] Fuzzy C-means clustering algorithm for automatically determining the number of clusters
    Wang, Zhihe
    Wang, Shuyan
    Du, Hui
    Guo, Hao
    2020 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2020), 2020, : 223 - 227
  • [14] Determining number of clusters and prototype locations via multi-scale clustering
    Nakamura, E
    Kehtarnavaz, N
    PATTERN RECOGNITION LETTERS, 1998, 19 (14) : 1265 - 1283
  • [15] Analysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters
    Lukashin, AV
    Fuchs, R
    BIOINFORMATICS, 2001, 17 (05) : 405 - 414
  • [16] A Morphology Method for Determining the Number of Clusters Present in Spectral Co-clustering Documents and Words
    Liu, Na
    Lu, Mingyu
    COMPUTATIONAL GEOMETRY, GRAPHS AND APPLICATIONS, 2011, 7033 : 130 - +
  • [17] Model Selection Strategies for Determining the Optimal Number of Overlapping Clusters in Additive Overlapping Partitional Clustering
    Julian Rossbroich
    Jeffrey Durieux
    Tom F. Wilderjans
    Journal of Classification, 2022, 39 : 264 - 301
  • [18] Model Selection Strategies for Determining the Optimal Number of Overlapping Clusters in Additive Overlapping Partitional Clustering
    Rossbroich, Julian
    Durieux, Jeffrey
    Wilderjans, Tom F.
    JOURNAL OF CLASSIFICATION, 2022, 39 (02) : 264 - 301
  • [19] Correlation Clustering with a Fixed Number of Clusters
    Giotis, Ioannis
    Guruswami, Venkatesan
    PROCEEDINGS OF THE SEVENTHEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2006, : 1167 - 1176
  • [20] Correlation Clustering with a Fixed Number of Clusters
    Giotis, Ioannis
    Guruswami, Venkatesan
    Theory of Computing, 2006, 2 : 249 - 266