Enhancing data analysis: uncertainty-resistance method for handling incomplete data

被引:0
|
作者
Javad Hamidzadeh
Mona Moradi
机构
[1] Sadjad University of Technology,Faculty of Computer Engineering and Information Technology
来源
Applied Intelligence | 2020年 / 50卷
关键词
Incomplete data; Missing values; Belief function theory; Mapped data; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
In data analysis, incomplete data commonly occurs and can have significant effects on the conclusions that can be drawn from the data. Incomplete data cause another problem, so-called uncertainty which leads to producing unreliable results. Hence, developing effective techniques to impute these missing values is crucial. Missing or incomplete data and noise are two common sources of uncertainty. In this paper, an effective method for imputing missing values is introduced which is robust to uncertainties that are arising from incompleteness and noise. A kernel-based method for removing the noise is designed. Using the belief function theory, the class of incomplete data is determined. Finally, every missing dimension is imputed considering the mean value of the same dimension of the members belonging to the determined class. The performance has been evaluated on real-world data sets from UCI repository. The results of the experiments have been compared with state-of-the-art methods, which show the superiority of the proposed method regarding classification accuracy.
引用
收藏
页码:74 / 86
页数:12
相关论文
共 50 条
  • [21] Special issue: Handling incomplete and fuzzy information in data analysis and decision processes
    Dubois, Didier
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2011, 52 (09) : 1229 - 1231
  • [22] Handling missing or incomplete data in a Bayesian network meta-analysis framework
    Azzolina, Danila
    Baldi, Ileana
    Berchialla, Paola
    Minto, Clara
    Gregori, Dario
    TRIALS, 2017, 18
  • [23] A classical method for uncertainty analysis with multidimensional data
    Willink, R
    Hall, BA
    METROLOGIA, 2002, 39 (04) : 361 - 369
  • [24] Practical issues in handling data input and uncertainty in a budget impact analysis
    M. J. C. Nuijten
    T. Mittendorf
    U. Persson
    The European Journal of Health Economics, 2011, 12 : 231 - 241
  • [25] Practical issues in handling data input and uncertainty in a budget impact analysis
    Nuijten, M. J. C.
    Mittendorf, T.
    Persson, U.
    EUROPEAN JOURNAL OF HEALTH ECONOMICS, 2011, 12 (03): : 231 - 241
  • [26] Solving the problem of discriminant analysis under the condition of structural uncertainty on the basis of group method of data handling
    Sarychev, A. P.
    JOURNAL OF AUTOMATION AND INFORMATION SCIENCES, 2008, 40 (06) : 27 - 40
  • [27] Handling incomplete data classification using imputed feature selected bagging (IFBag) method
    Khan, Ahmad Jaffar
    Raza, Basit
    Shahid, Ahmad Raza
    Kumar, Yogan Jaya
    Faheem, Muhammad
    Alquhayz, Hani
    INTELLIGENT DATA ANALYSIS, 2021, 25 (04) : 825 - 846
  • [28] Handling the incomplete data problem using Bayesian networks
    Wang, S.C.
    Lin, S.M.
    Lu, Y.C.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2000, 40 (09): : 65 - 68
  • [29] MacroPARAFAC for handling rowwise and cellwise outliers in incomplete data
    Hubert, Mia
    Hirari, Mehdi
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 251
  • [30] Handling Incomplete Data Using Evolution of Imputation Methods
    Zawistowski, Pawel
    Grzenda, Maciej
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, 2009, 5495 : 22 - +