Nonsmooth Penalized Clustering via lp Regularized Sparse Regression

被引:18
|
作者
Niu, Lingfeng [1 ,2 ]
Zhou, Ruizhi [1 ,2 ]
Tian, Yingjie [1 ,2 ]
Qi, Zhiquan [1 ,2 ]
Zhang, Peng [3 ]
机构
[1] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[3] Univ Technol Sydney, Ctr QCIS, Sydney, NSW 2007, Australia
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
l(p)-norm; clustering analysis; nonconvex optimization; nonsmooth optimization; penalized regression; SELECTION; PERFORMANCE; MODEL; INTELLIGENCE; EVOLUTIONARY; METHODOLOGY; ALGORITHMS; SIGNALS; NUMBER; TESTS;
D O I
10.1109/TCYB.2016.2546965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering has been widely used in data analysis. A majority of existing clustering approaches assume that the number of clusters is given in advance. Recently, a novel clustering framework is proposed which can automatically learn the number of clusters from training data. Based on these works, we propose a nonsmooth penalized clustering model via l(p)( 0 < p < 1) regularized sparse regression. In particular, this model is formulated as a nonsmooth nonconvex optimization, which is based on over-parameterization and utilizes an l(p)norm-based regularization to control the tradeoff between the model fit and the number of clusters. We theoretically prove that the new model can guarantee the sparseness of cluster centers. To increase its practicality for practical use, we adhere to an easy-to-compute criterion and follow a strategy to narrow down the search interval of cross validation. To address the non-smoothness and nonconvexness of the cost function, we propose a simple smoothing trust region algorithm and present its convergent and computational complexity analysis. Numerical studies on both simulated and practical data sets provide support to our theoretical results and demonstrate the advantages of our new method.
引用
收藏
页码:1423 / 1433
页数:11
相关论文
共 50 条
  • [21] Sparse Poisson regression with penalized weighted score function
    Jia, Jinzhu
    Xie, Fang
    Xu, Lihu
    ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (02): : 2898 - 2920
  • [22] Graph Regularized Structured Sparse Subspace Clustering
    You, Cong-Zhe
    Wu, Xiao-Jun
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 128 - 136
  • [23] Sparse regression with exact clustering
    She, Yiyuan
    ELECTRONIC JOURNAL OF STATISTICS, 2010, 4 : 1055 - 1096
  • [24] Penalized empirical likelihood for the sparse Cox regression model
    Wang, Dongliang
    Wu, Tong Tong
    Zhao, Yichuan
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2019, 201 : 71 - 85
  • [25] Gene and pathway identification with Lp penalized Bayesian logistic regression
    Liu, Zhenqiu
    Gartenhaus, Ronald B.
    Tan, Ming
    Jiang, Feng
    Jiao, Xiaoli
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [26] Sparse Pose Regression via Componentwise Clustering Feature Point Representation
    Liu, Honglin
    Kong, Dehui
    Wang, Shaofan
    Yin, Baocai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (07) : 1233 - 1244
  • [27] On the superiority of PGMs to PDCAs in nonsmooth nonconvex sparse regression
    Nakayama, Shummin
    Gotoh, Jun-ya
    OPTIMIZATION LETTERS, 2021, 15 (08) : 2831 - 2860
  • [28] On the superiority of PGMs to PDCAs in nonsmooth nonconvex sparse regression
    Shummin Nakayama
    Jun-ya Gotoh
    Optimization Letters, 2021, 15 : 2831 - 2860
  • [29] ADMM for High-Dimensional Sparse Penalized Quantile Regression
    Gu, Yuwen
    Fan, Jun
    Kong, Lingchen
    Ma, Shiqian
    Zou, Hui
    TECHNOMETRICS, 2018, 60 (03) : 319 - 331
  • [30] Penalized principal logistic regression for sparse sufficient dimension reduction
    Shin, Seung Jun
    Artemiou, Andreas
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 111 : 48 - 58