A multiple kernel-based kernel density estimator for multimodal probability density functions

被引:5
|
作者
Chen, Jia-Qi [1 ,2 ]
He, Yu -Lin [1 ,2 ]
Cheng, Ying-Chao [2 ]
Fournier-Viger, Philippe [1 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Shenzhen Univ, Big Data Inst, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
关键词
Multimodal probability density function; Kernel density estimator; Heuristic k-nearest neighbor strategy; Kernel function; Kernel bandwidth; MODIFIED CROSS-VALIDATION; MAXIMUM-ENTROPY METHOD; BAYESIAN CLASSIFIERS; BANDWIDTH SELECTION; SMOOTHING PARAMETER; BOOTSTRAP CHOICE; ALGORITHMS; MOMENT;
D O I
10.1016/j.engappai.2024.107979
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of the single kernel -based kernel density estimator (SK-KDE) in fitting a unimodal probability density function (PDF) depends on the choice of kernel function and the corresponding selection of kernel bandwidth. Unlike unimodal PDFs, a multimodal PDF has several distinct features. First, it has multiple local maxima. Second, it is composed of various unimodal PDFs. Each of these unimodal PDFs corresponds to a different collection of random variables. Importantly, these variables are not independent and identically distributed. Because of the difficulty in quantifying multimodality among different modes, it is extremely difficult to select an appropriate kernel function and optimal kernel bandwidth for the multimodal PDF. Multimodal PDFs are frequently encountered in real -world applications. To address this, this paper proposes a novel multiple kernel -based kernel density estimator (MK-KDE). It constructs a flexible KDE by using the weighted average of multiple kernels with consideration of their kernel efficiencies. By integrating multiple kernels, MK-KDE leverages their complementary strengths to enhance the estimation of complex and multimodal PDFs. To achieve this, a novel efficient objective function is designed to obtain the optimized kernel weights and kernel bandwidths by minimizing both the global estimation error of MK-KDE and the local estimation errors of SK-KDEs. Moreover, a sophisticated k -nearest neighbor strategy is devised as a heuristic method to determine the unknown PDF values of given data points, thereby optimizing the aforementioned objective function. A series of extensive experiments was conducted to validate the feasibility, rationality, and effectiveness of MK-KDE for 10 multimodal PDFs. The experimental results show that (1) the kernel weights and bandwidths of MK-KDE converge as the iteration number of the optimization algorithm increases; (2) MK-KDE can fit multimodal PDFs by automatically selecting the kernel functions and bandwidths; and (3) MK-KDE obtains lower estimation errors on 10 multimodal PDFs in comparison to 10 existing PDF estimation methods, demonstrating that MK-KDE is a viable approach to estimate multimodal PDFs.
引用
收藏
页数:16
相关论文
共 50 条