A novel discretization algorithm based on multi-scale and information entropy

被引:0
|
作者
Yaling Xun
Qingxia Yin
Jifu Zhang
Haifeng Yang
Xiaohui Cui
机构
[1] Taiyuan University of Science and Technology (TYUST),
来源
Applied Intelligence | 2021年 / 51卷
关键词
Data mining; Discretization; Information entropy; Multi-scale; MDLPC criterion;
D O I
暂无
中图分类号
学科分类号
摘要
Discretization is one of the data preprocessing topics in the field of data mining, and is a critical issue to improve the efficiency and quality of data mining. Multi-scale can reveal the structure and hierarchical characteristics of data objects, the representation of the data in different granularities will be obtained if we make a reasonable hierarchical division for a research object. The multi-scale theory is introduced into the process of data discretization and a data discretization method based on multi-scale and information entropy called MSE is proposed. MSE first conducts scale partition on the domain attribute to obtain candidate cut point set with different granularity. Then, the information entropy is applied to the candidate cut point set, and the candidate cut point with the minimum information entropy is selected and detected in turn to determine the final cut point set using the MDLPC criterion. In such way, MSE avoids the problem that the candidate cut points are limited to only certain limited attribute values caused by considering only the statistical attribute values in the traditional discretization methods, and reduces the number of candidates by controlling the data division hierarchy to an optimal range. Finally, the extensive experiments show that MSE achieves high performance in terms of discretization efficiency and classification accuracy, especially when it is applied to support vector machines, random forest, and decision trees.
引用
收藏
页码:991 / 1009
页数:18
相关论文
共 50 条
  • [21] CSFNet: A novel counting network based on context features and multi-scale information
    Xiong, Liyan
    Li, Zhida
    Huang, Xiaohui
    Wang, Heng
    Multimedia Systems, 2025, 31 (01)
  • [22] An Improved Multi-Scale Entropy Algorithm in Emotion EEG Features Extraction
    Li Xin
    Qi Xiaoying
    Sun Xiaoqi
    Xie Jiali
    Fan Mengdi
    Kang Jiannan
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2017, 7 (02) : 436 - 439
  • [23] Unsupervised Segmentation Algorithm Based on Multi-Scale Feature Fusion and Novel Discriminator
    Han, Zonghuan
    Liu, Mingguo
    Li, Shen
    Chen, Lijia
    Tian, Min
    Lan, Tianxiang
    Liang, Qian
    Computer Engineering and Applications, 2023, 59 (07) : 152 - 162
  • [24] An Intelligent Fault Diagnosis Method of Multi-Scale Deep Feature Fusion Based on Information Entropy
    Zhiwu Shang
    Wanxiang Li
    Maosheng Gao
    Xia Liu
    Yan Yu
    Chinese Journal of Mechanical Engineering, 2021, 34
  • [25] Roller bearing fault diagnosis based on LMD and multi-scale symbolic dynamic information entropy
    Minghong Han
    Yaman Wu
    Yumin Wang
    Wei Liu
    Journal of Mechanical Science and Technology, 2021, 35 : 1993 - 2005
  • [26] An Intelligent Fault Diagnosis Method of Multi-Scale Deep Feature Fusion Based on Information Entropy
    Zhiwu Shang
    Wanxiang Li
    Maosheng Gao
    Xia Liu
    Yan Yu
    Chinese Journal of Mechanical Engineering, 2021, 34 (04) : 132 - 147
  • [27] Roller bearing fault diagnosis based on LMD and multi-scale symbolic dynamic information entropy
    Han, Minghong
    Wu, Yaman
    Wang, Yumin
    Liu, Wei
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2021, 35 (05) : 1993 - 2005
  • [28] An Intelligent Fault Diagnosis Method of Multi-Scale Deep Feature Fusion Based on Information Entropy
    Shang, Zhiwu
    Li, Wanxiang
    Gao, Maosheng
    Liu, Xia
    Yu, Yan
    CHINESE JOURNAL OF MECHANICAL ENGINEERING, 2021, 34 (01)
  • [29] A method to enhance information entropy of diesel engine signals based on multi-scale dimension reduction
    Wu C.
    Jia J.
    Jia X.
    Zhang S.
    Zhendong yu Chongji/Journal of Vibration and Shock, 2018, 37 (03): : 180 - 185
  • [30] Optimal Scale Selection and Attribute Reduction of Multi-scale Multiset-Valued Information Systems Based on Entropy
    Wang L.
    Wu W.
    Xie Z.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2023, 36 (06): : 495 - 510