Nonsmooth Penalized Clustering via lp Regularized Sparse Regression

被引:18
|
作者
Niu, Lingfeng [1 ,2 ]
Zhou, Ruizhi [1 ,2 ]
Tian, Yingjie [1 ,2 ]
Qi, Zhiquan [1 ,2 ]
Zhang, Peng [3 ]
机构
[1] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[3] Univ Technol Sydney, Ctr QCIS, Sydney, NSW 2007, Australia
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
l(p)-norm; clustering analysis; nonconvex optimization; nonsmooth optimization; penalized regression; SELECTION; PERFORMANCE; MODEL; INTELLIGENCE; EVOLUTIONARY; METHODOLOGY; ALGORITHMS; SIGNALS; NUMBER; TESTS;
D O I
10.1109/TCYB.2016.2546965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering has been widely used in data analysis. A majority of existing clustering approaches assume that the number of clusters is given in advance. Recently, a novel clustering framework is proposed which can automatically learn the number of clusters from training data. Based on these works, we propose a nonsmooth penalized clustering model via l(p)( 0 < p < 1) regularized sparse regression. In particular, this model is formulated as a nonsmooth nonconvex optimization, which is based on over-parameterization and utilizes an l(p)norm-based regularization to control the tradeoff between the model fit and the number of clusters. We theoretically prove that the new model can guarantee the sparseness of cluster centers. To increase its practicality for practical use, we adhere to an easy-to-compute criterion and follow a strategy to narrow down the search interval of cross validation. To address the non-smoothness and nonconvexness of the cost function, we propose a simple smoothing trust region algorithm and present its convergent and computational complexity analysis. Numerical studies on both simulated and practical data sets provide support to our theoretical results and demonstrate the advantages of our new method.
引用
收藏
页码:1423 / 1433
页数:11
相关论文
共 50 条
  • [31] Locality Constrained-lp Sparse Subspace Clustering for Image Clustering
    Cheng, Wenlong
    Chow, Tommy W. S.
    Zhao, Mingbo
    NEUROCOMPUTING, 2016, 205 : 22 - 31
  • [32] Sparse nonparametric regression with regularized tensor product kernel
    Yu, Hang
    Wang, Yuanjia
    Zeng, Donglin
    STAT, 2020, 9 (01):
  • [33] Hyperspectral image clustering via sparse dictionary-based anchored regression
    Huang, Nan
    Xiao, Liang
    IET IMAGE PROCESSING, 2019, 13 (02) : 261 - 269
  • [34] Hierarchical Regression for Single Image Super Resolution via Clustering and Sparse Representation
    Qiu, Kang
    Yi, Benshun
    Li, Weizhong
    Huang, Taiqi
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (05): : 2539 - 2554
  • [35] Penalized regression via the restricted bridge estimator
    Bahadır Yüzbaşı
    Mohammad Arashi
    Fikri Akdeniz
    Soft Computing, 2021, 25 : 8401 - 8416
  • [36] Penalized regression via the restricted bridge estimator
    Yuzbasi, Bahadir
    Arashi, Mohammad
    Akdeniz, Fikri
    SOFT COMPUTING, 2021, 25 (13) : 8401 - 8416
  • [37] Autocovariance Function Estimation via Penalized Regression
    Liao, Lina
    Park, Cheolwoo
    Hannig, Jan
    Kang, Kee-Hoon
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2016, 25 (04) : 1041 - 1056
  • [38] Model Selection Via Penalized Logistic Regression
    Ayers, Kristin L.
    Cordell, Heather J.
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 770 - 770
  • [39] Relative Error Prediction via Penalized Regression
    Jeong, Seok-Oh
    Lee, Seo-Eun
    Shin, Key-Il
    KOREAN JOURNAL OF APPLIED STATISTICS, 2015, 28 (06) : 1103 - 1111
  • [40] Regularized boxplot via convex clustering
    Choi, Hosik
    Poythress, J. C.
    Park, Cheolwoo
    Jeon, Jong-June
    Park, Changyi
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (07) : 1227 - 1247