A novel discretization algorithm based on multi-scale and information entropy

被引:0
|
作者
Yaling Xun
Qingxia Yin
Jifu Zhang
Haifeng Yang
Xiaohui Cui
机构
[1] Taiyuan University of Science and Technology (TYUST),
来源
Applied Intelligence | 2021年 / 51卷
关键词
Data mining; Discretization; Information entropy; Multi-scale; MDLPC criterion;
D O I
暂无
中图分类号
学科分类号
摘要
Discretization is one of the data preprocessing topics in the field of data mining, and is a critical issue to improve the efficiency and quality of data mining. Multi-scale can reveal the structure and hierarchical characteristics of data objects, the representation of the data in different granularities will be obtained if we make a reasonable hierarchical division for a research object. The multi-scale theory is introduced into the process of data discretization and a data discretization method based on multi-scale and information entropy called MSE is proposed. MSE first conducts scale partition on the domain attribute to obtain candidate cut point set with different granularity. Then, the information entropy is applied to the candidate cut point set, and the candidate cut point with the minimum information entropy is selected and detected in turn to determine the final cut point set using the MDLPC criterion. In such way, MSE avoids the problem that the candidate cut points are limited to only certain limited attribute values caused by considering only the statistical attribute values in the traditional discretization methods, and reduces the number of candidates by controlling the data division hierarchy to an optimal range. Finally, the extensive experiments show that MSE achieves high performance in terms of discretization efficiency and classification accuracy, especially when it is applied to support vector machines, random forest, and decision trees.
引用
收藏
页码:991 / 1009
页数:18
相关论文
共 50 条
  • [41] Image Deraining Algorithm Based on Multi-Scale Features
    Yang, Jingkai
    Wang, Jingyuan
    Li, Yanbo
    Yao, Bobin
    Xu, Tangwen
    Lu, Ting
    Gao, Xiaoxuan
    Chen, Junshuo
    Liu, Weiyu
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [42] Arc Fault Detection Algorithm Based on Variational Mode Decomposition and Improved Multi-Scale Fuzzy Entropy
    Wang, Lina
    Qiu, Hongcheng
    Yang, Pu
    Mu, Longhua
    ENERGIES, 2021, 14 (14)
  • [43] A Novel Algorithm of Edge Location Based on Omni-directional and Multi-scale MM
    Zhou, Hang
    Peng, Dan
    Wang, Xin
    Wang, Hongyi
    JOURNAL OF COMPUTERS, 2014, 9 (04) : 990 - 997
  • [44] A multi-scale research algorithm based on lift wavelet
    Chen Xim
    Dou Li-hua
    Zhang Juan
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 5171 - 5174
  • [45] A Multi-Scale Gradient Algorithm Based on Morphological Operators
    LU Guan-ming (Department of Information Engineering
    The Journal of China Universities of Posts and Telecommunications, 2000, (Z1) : 56 - 59
  • [46] An image matching algorithm based on multi-scale space
    Yang, Zhao-Hui
    Chen, Ying
    Shao, Yong-She
    Zhang, Shao-Ming
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2009, 20 (06): : 826 - 829
  • [47] A fast BEMD algorithm based on multi-scale extrema
    Yang, D.
    Wang, X. T.
    Xu, G. L.
    INFORMATION TECHNOLOGY, 2015, : 29 - 32
  • [48] Watermarking algorithm based on multi-scale error diffusion
    Wan, Xiaoxia
    Wu, Hanying
    Gan, Chaohua
    Geomatics and Information Science of Wuhan University, 2007, 32 (11) : 1056 - 1059
  • [49] Novel infrared dim and small target detection algorithm based on multi-scale gradient
    Wan M.
    Zhang F.
    Hu S.
    Guangxue Xuebao/Acta Optica Sinica, 2011, 31 (10): : 1011001 - 1
  • [50] Unsupervised Dehazing Algorithm Based on Multi-Scale Features
    Sun Xiangsheng
    Wang Guozhong
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (16)