Information granule-based classifier: A development of granular imputation of missing data

被引:17
|
作者
Hu, Xingchen [1 ]
Pedrycz, Witold [2 ,3 ]
Wu, Keyu [1 ]
Shen, Yinghua [4 ]
机构
[1] Natl Univ Def Technol, Coll Syst Engn, Changsha 410073, Hunan, Peoples R China
[2] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6R 2V4, Canada
[3] Polish Acad Sci, Syst Res Inst, Warsaw, Poland
[4] Chongqing Univ, Sch Econ & Business Adm, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Granular computing; Classification model; Data imputation; Fuzzy clustering; Principle of justifiable granularity; FUZZY C-MEANS; FEATURE-SELECTION; PERFORMANCE; DESIGN;
D O I
10.1016/j.knosys.2020.106737
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Granular Computing (GrC) is a human-centric way to discover the fundamental structure of data sets. The resulting information granules can be efficiently exploited to organize knowledge and reveal data descriptions, which can play a pivotal role in the classification problems. Furthermore, information granules are abstract collections of data entities and exhibit flexibility and tolerance when it comes to the representation of incomplete data. However, most of the existing methods focused on the data imputation and classification separately. They also require better interpretability. The crux of this study is to develop a novel information granule-based classification method for incomplete data and a way of representing missing entities and regarding them as information granules in a unified framework. The first aspect focuses on revealing the structural backbone of multiple labeled subspaces of data by fuzzy clustering of missing values. It emerges a classifier with interpretable "IF-THEN" rules by the refinement of fuzzy prototypes in a supervised mode to capture the critical relationship of the multi-class incomplete data. The second aspect concerns the construction of some information granules to impute and represent missing values according to the refined prototypes and classification findings. The experimental studies involved synthetic and publicly available datasets in quantifying the advantages of the classification and representation abilities of the proposed methods on incomplete data. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An interpretable hypersphere information granule-based classifier for numeric data using axiomatic fuzzy set
    Wang H.-S.
    Lu W.
    Granular Computing, 2024, 9 (03)
  • [2] A Hypersphere Information Granule-Based Fuzzy Classifier Embedded With Fuzzy Cognitive Maps for Classification of Imbalanced Data
    Yin, Rui
    Lu, Wei
    Yang, Jianhua
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (01): : 175 - 190
  • [3] Rule-based granular classification: A hypersphere information granule-based method
    Fu, Chen
    Lu, Wei
    Pedrycz, Witold
    Yang, Jianhua
    KNOWLEDGE-BASED SYSTEMS, 2020, 194
  • [4] A design of information granule-based under-sampling method in imbalanced data classification
    Liu, Tianyu
    Zhu, Xiubin
    Pedrycz, Witold
    Li, Zhiwu
    SOFT COMPUTING, 2020, 24 (22) : 17333 - 17347
  • [5] A design of information granule-based under-sampling method in imbalanced data classification
    Tianyu Liu
    Xiubin Zhu
    Witold Pedrycz
    Zhiwu Li
    Soft Computing, 2020, 24 : 17333 - 17347
  • [6] Adaptive pairing of classifier and imputation methods based on the characteristics of missing values in data sets
    Sim, Jaemun
    Kwon, Ohbyung
    Lee, Kun Chang
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 485 - 493
  • [7] MissII: Missing Information Imputation for Traffic Data
    Hou, Mingliang
    Tang, Tao
    Xia, Feng
    Sultan, Ibrahim
    Kaur, Roopdeep
    Kong, Xiangjie
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (03) : 752 - 765
  • [8] Imputation of missing information in worldwide patent data
    de Rassenfosse, Gaetan
    Seliger, Florian
    DATA IN BRIEF, 2021, 34
  • [9] Polynomial Fuzzy Information Granule-Based Time Series Prediction
    Yang, Xiyang
    Zhang, Shiqing
    Zhang, Xinjun
    Yu, Fusheng
    MATHEMATICS, 2022, 10 (23)
  • [10] Learning a Credal Classifier With Optimized and Adaptive Multiestimation for Missing Data Imputation
    Zhang, Zuo-Wei
    Tian, Hong-Peng
    Yan, Ling-Zhi
    Martin, Arnaud
    Zhou, Kuang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4092 - 4104