A Semi-parametric Density Estimation with Application in Clustering

被引:0
|
作者
Salehi, Mahdi [1 ,2 ]
Bekker, Andriette [2 ]
Arashi, Mohammad [3 ]
机构
[1] Univ Neyshabur, Dept Math & Stat, Neyshabur, Iran
[2] Univ Pretoria, Dept Stat, Pretoria, South Africa
[3] Ferdowsi Univ Mashhad, Dept Stat, Mashhad, Iran
基金
新加坡国家研究基金会;
关键词
Asymmetric kernels; Boundary bias; Density-based clustering; Density-based Silhouette; Kernel density estimation; Optimum bandwidth; BETA KERNEL GRADUATION; MIXTURES;
D O I
10.1007/s00357-022-09425-9
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The idea behind density-based clustering is to associate groups to the connected components of the level sets of the density of the data to be estimated by a nonparametric method. This approach claims some advantages over both distance- and model-based clustering. Some researchers developed this technique by proposing a graph theory-based method for identifying local modes of the underlying density being estimated by the well-known kernel density estimation (KDE) with normal and t kernels. The present work proposes a semi-parametric KDE with a more flexible family of kernels including skew-normal (SN) and skew-t (ST). We show that the proposed estimator not only reduces boundary bias but it is also closer to the actual density compared to that of the usual estimator employing the Gaussian kernel. Finding optimal bandwidth for one-dimensional and multidimensional cases under the mentioned asymmetric kernels is another main result of this paper where we shrink the bandwidth more than the one obtained under the normal assumption. Finally, through a comprehensive numerical study, we will illustrate the application of the proposed semi-parametric KDE on the density-based clustering using some simulated and real data sets.
引用
收藏
页码:52 / 78
页数:27
相关论文
共 50 条
  • [1] A Semi-parametric Density Estimation with Application in Clustering
    Mahdi Salehi
    Andriette Bekker
    Mohammad Arashi
    [J]. Journal of Classification, 2023, 40 : 52 - 78
  • [2] Density estimation using non-parametric and semi-parametric mixtures
    Wang, Yong
    Chee, Chew-Seng
    [J]. STATISTICAL MODELLING, 2012, 12 (01) : 67 - 92
  • [3] Semi-parametric estimation of shifts
    Gamboa, Fabrice
    Loubes, Jean-Michel
    Maza, Elie
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2007, 1 : 616 - 640
  • [4] VALUE AT RISK ESTIMATION BY COMBINING SEMI-PARAMETRIC DENSITY ESTIMATION WITH HISTORICAL SIMULATION
    Wang, Kaiping
    [J]. ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH, 2012, 46 (04): : 163 - 178
  • [5] Semi-parametric estimation for ARCH models
    Alzghool, Raed
    Al-Zubi, Loai M.
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2018, 57 (01) : 367 - 373
  • [6] SEMI-PARAMETRIC ESTIMATION OF INEQUALITY MEASURES
    Kpanzou, T. A.
    de Wet, T.
    Neethling, A.
    [J]. SOUTH AFRICAN STATISTICAL JOURNAL, 2013, 47 (01) : 33 - 48
  • [7] Semi-parametric density estimation for time-series with multiplicative adjustment
    Wang, Kaiping
    Lin, Lu
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2008, 37 (08) : 1274 - 1283
  • [8] Sparse Semi-Parametric Chirp Estimation
    Sward, Johan
    Brynolfsson, Johan
    Jakobsson, Andreas
    Hansson-Sandsten, Maria
    [J]. CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 1236 - 1240
  • [10] Semi-Parametric Models - An Application in Medicine
    Pereira, J. A.
    Pereira, A. L.
    Oliveira, T. A.
    [J]. INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS ICNAAM 2019, 2020, 2293