Nonsmooth Penalized Clustering via lp Regularized Sparse Regression

被引:18
|
作者
Niu, Lingfeng [1 ,2 ]
Zhou, Ruizhi [1 ,2 ]
Tian, Yingjie [1 ,2 ]
Qi, Zhiquan [1 ,2 ]
Zhang, Peng [3 ]
机构
[1] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[3] Univ Technol Sydney, Ctr QCIS, Sydney, NSW 2007, Australia
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
l(p)-norm; clustering analysis; nonconvex optimization; nonsmooth optimization; penalized regression; SELECTION; PERFORMANCE; MODEL; INTELLIGENCE; EVOLUTIONARY; METHODOLOGY; ALGORITHMS; SIGNALS; NUMBER; TESTS;
D O I
10.1109/TCYB.2016.2546965
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering has been widely used in data analysis. A majority of existing clustering approaches assume that the number of clusters is given in advance. Recently, a novel clustering framework is proposed which can automatically learn the number of clusters from training data. Based on these works, we propose a nonsmooth penalized clustering model via l(p)( 0 < p < 1) regularized sparse regression. In particular, this model is formulated as a nonsmooth nonconvex optimization, which is based on over-parameterization and utilizes an l(p)norm-based regularization to control the tradeoff between the model fit and the number of clusters. We theoretically prove that the new model can guarantee the sparseness of cluster centers. To increase its practicality for practical use, we adhere to an easy-to-compute criterion and follow a strategy to narrow down the search interval of cross validation. To address the non-smoothness and nonconvexness of the cost function, we propose a simple smoothing trust region algorithm and present its convergent and computational complexity analysis. Numerical studies on both simulated and practical data sets provide support to our theoretical results and demonstrate the advantages of our new method.
引用
收藏
页码:1423 / 1433
页数:11
相关论文
共 50 条
  • [11] ROBUST HEAD POSE ESTIMATION VIA CONVEX REGULARIZED SPARSE REGRESSION
    Ji, Hao
    Liu, Risheng
    Su, Fei
    Su, Zhixun
    Tian, Yan
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [12] Spectrally Sparse Nonparametric Regression via Elastic Net Regularized Smoothers
    Helwig, Nathaniel E.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (01) : 182 - 191
  • [13] Feature selection for probabilistic load forecasting via sparse penalized quantile regression
    Wang, Yi
    Gan, Dahua
    Zhang, Ning
    Xie, Le
    Kang, Chongqing
    JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2019, 7 (05) : 1200 - 1209
  • [14] Relaxed sparse eigenvalue conditions for sparse estimation via non-convex regularized regression
    Pan, Zheng
    Zhang, Changshui
    PATTERN RECOGNITION, 2015, 48 (01) : 231 - 243
  • [15] Penalized robust estimators in sparse logistic regression
    Bianco, Ana M.
    Boente, Graciela
    Chebi, Gonzalo
    TEST, 2022, 31 (03) : 563 - 594
  • [16] Penalized robust estimators in sparse logistic regression
    Ana M. Bianco
    Graciela Boente
    Gonzalo Chebi
    TEST, 2022, 31 : 563 - 594
  • [17] Sparse Autoregressive Modeling via he Least Absolute LP-Norm Penalized Solution
    Bore, Joyce Chelangat
    Ayedh, Walid Mohammed Ahmed
    Li, Peiyang
    Yao, Dezhong
    Xu, Peng
    IEEE ACCESS, 2019, 7 : 40959 - 40968
  • [18] Sparse brain network using penalized linear regression
    Lee, Hyekyoung
    Lee, Dong Soo
    Kang, Hyejin
    Kim, Boong-Nyun
    Chung, Moo K.
    MEDICAL IMAGING 2011: BIOMEDICAL APPLICATIONS IN MOLECULAR, STRUCTURAL, AND FUNCTIONAL IMAGING, 2011, 7965
  • [19] Penalized Sparse Covariance Regression with High Dimensional Covariates
    Gao, Yuan
    Zhang, Zhiyuan
    Cai, Zhanrui
    Zhu, Xuening
    Zou, Tao
    Wang, Hansheng
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2024,
  • [20] Confidence Intervals for Sparse Penalized Regression With Random Designs
    Yu, Guan
    Yin, Liang
    Lu, Shu
    Liu, Yufeng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (530) : 794 - 809