Enhancing data analysis: uncertainty-resistance method for handling incomplete data

被引:0
|
作者
Javad Hamidzadeh
Mona Moradi
机构
[1] Sadjad University of Technology,Faculty of Computer Engineering and Information Technology
来源
Applied Intelligence | 2020年 / 50卷
关键词
Incomplete data; Missing values; Belief function theory; Mapped data; Classification;
D O I
暂无
中图分类号
学科分类号
摘要
In data analysis, incomplete data commonly occurs and can have significant effects on the conclusions that can be drawn from the data. Incomplete data cause another problem, so-called uncertainty which leads to producing unreliable results. Hence, developing effective techniques to impute these missing values is crucial. Missing or incomplete data and noise are two common sources of uncertainty. In this paper, an effective method for imputing missing values is introduced which is robust to uncertainties that are arising from incompleteness and noise. A kernel-based method for removing the noise is designed. Using the belief function theory, the class of incomplete data is determined. Finally, every missing dimension is imputed considering the mean value of the same dimension of the members belonging to the determined class. The performance has been evaluated on real-world data sets from UCI repository. The results of the experiments have been compared with state-of-the-art methods, which show the superiority of the proposed method regarding classification accuracy.
引用
收藏
页码:74 / 86
页数:12
相关论文
共 50 条
  • [1] Enhancing data analysis: uncertainty-resistance method for handling incomplete data
    Hamidzadeh, Javad
    Moradi, Mona
    APPLIED INTELLIGENCE, 2020, 50 (01) : 74 - 86
  • [3] HANDLING OF INCOMPLETE DATA IN CLASSIFICATION
    CHAN, L
    BIOMETRICS, 1972, 28 (04) : 1162 - 1162
  • [4] Handling incomplete smoking history data in survival analysis
    Furukawa, Kyoji
    Preston, Dale L.
    Misumi, Munechika
    Cullings, Harry M.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (02) : 707 - 723
  • [5] Analyzing nuclear reactor simulation data and uncertainty with the group method of data handling
    Radaideh, Majdi, I
    Kozlowski, Tomasz
    NUCLEAR ENGINEERING AND TECHNOLOGY, 2020, 52 (02) : 287 - 295
  • [6] Uncertainty Handling in Geospatial Data
    Doucette, Peter J.
    Motsko, Dennis J.
    Sorenson, Matthew
    White, Devin A.
    GEOSPATIAL INFOFUSION II, 2012, 8396
  • [7] Data Preprocessing Method For The Analysis Of Incomplete Data On Students In Poverty
    Huang, Haiyan
    Wei, Bizhong
    Dai, Jian
    Ke, Wenlong
    2020 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2020), 2020, : 248 - 252
  • [8] Uncertainty handling in remote sensing data analysis for defence application
    Mandal, DP
    Murthy, CA
    Pal, SK
    DEFENCE SCIENCE JOURNAL, 1995, 45 (04) : 303 - 306
  • [9] ESTIMATION WITH INCOMPLETE DATA - IMPROVED COMPUTATIONAL METHOD AND THE ANALYSIS OF NESTED DATA
    HOCKING, RR
    MARX, DL
    COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1979, 8 (12): : 1155 - 1181
  • [10] Handling incomplete categorical data for supervised learning
    Chien, Been-Chian
    Lu, Cheng-Feng
    Hsu, Steen J.
    ADVANCES IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4031 : 1318 - 1328