Partial instance reduction for noise elimination

被引:13
|
作者
Jamjoom, Mona [1 ]
El Hindi, Khalil [1 ]
机构
[1] King Saud Univ, Dept Comp Sci, Collage Comp & Informat Sci, Riyadh, Saudi Arabia
关键词
Noise filtering; Instance-Based Learning; Instance reduction; Overfitting; Outlier elimination; NEAREST-NEIGHBOR; CLASSIFICATION;
D O I
10.1016/j.patrec.2016.01.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-world data are usually noisy, causing many machine-learning algorithms to overfit their data. Various Instance Reduction (IR) techniques have been proposed to filter out noisy instances and clean the data. This paper presents Partial Instance Reduction (PIR) or partial outlier elimination techniques. Unlike IR techniques, which eliminate all suspicious instances, PIR techniques partially eliminate a suspicious instance by eliminating some of its attribute values. If this fails to change the status of an instance from outlier to normal, the entire instance is eliminated. The main advantage of partial elimination is that it allows us to retain significant parts of instances, which is particularly useful when the training data is scarce. This paper compares PIR and IR techniques using 50 benchmark data sets, both with and without noise. Our empirical results show that PIR techniques significantly outperform the IR techniques on many benchmark datasets. Whereas IR techniques eliminate a large number of instances that are not outliers, PIR techniques manage to save parts of these instances that are useful for classification. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 37
页数:8
相关论文
共 50 条
  • [1] NOISE REDUCTION BY PARTIAL ENCLOSURES
    TYZZER, FG
    BISHOP, DE
    HARDY, HC
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1956, 28 (04): : 758 - 758
  • [2] Simulation of Partial Discharges and Implementation of Noise Elimination Techniques
    Rajendran, Arunjothi
    Meena, K. P.
    Burjupati, Nageshwar Rao
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON CONDITION ASSESSMENT TECHNIQUES IN ELECTRICAL SYSTEMS (CATCON), 2017, : 412 - 417
  • [3] Noise Reduction in Regression Tasks with Distance, Instance, Attribute and Density Weighting
    Kordos, Miroslaw
    Rusiecki, Andrzej
    Blachnik, Marcin
    [J]. 2015 IEEE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2015, : 73 - 78
  • [4] Noise reduction for instance-based learning with a local maximal margin approach
    Nicola Segata
    Enrico Blanzieri
    Sarah Jane Delany
    Pádraig Cunningham
    [J]. Journal of Intelligent Information Systems, 2010, 35 : 301 - 331
  • [5] Noise reduction for instance-based learning with a local maximal margin approach
    Segata, Nicola
    Blanzieri, Enrico
    Delany, Sarah Jane
    Cunningham, Padraig
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2010, 35 (02) : 301 - 331
  • [6] Multiple instance classification: Bag noise filtering for negative instance noise cleaning
    Luengo, Julian
    Sanchez-Tarrago, Danel
    Prati, Ronaldo C.
    Herrera, Francisco
    [J]. INFORMATION SCIENCES, 2021, 579 : 388 - 400
  • [7] Partial discharge pattern recognition considering noise elimination and voltage phase lag
    Yoon, Sungho
    An, Beom
    Ko, Hyeonsang
    Kim, Jeongtae
    Jung, Yeonha
    Jang, Taein
    [J]. Transactions of the Korean Institute of Electrical Engineers, 2020, 69 (07): : 1024 - 1032
  • [8] PARTIAL ELIMINATION
    JENNINGS, A
    MALIK, GM
    [J]. JOURNAL OF THE INSTITUTE OF MATHEMATICS AND ITS APPLICATIONS, 1977, 20 (03): : 307 - 316
  • [9] Noise Reduction in the Vehicle Transmissions by Partial Bracing by means of Synchronizations
    Heumesser, Benjamin
    Gretzinger, Yvonne
    Bertsche, Bernd
    [J]. KUPPLUNGEN UND KUPPLUNGSSYSTEME IN ANTRIEBEN 2017/ SCHWINGUNGSREDUZIERUNG IN MOBILEN SYSTEMEN 2017, 2017, 2309 : 315 - 326
  • [10] MaMiCo: Parallel Noise Reduction for Multi-instance Molecular-Continuum Flow Simulation
    Jarmatz, Piet
    Neumann, Philipp
    [J]. COMPUTATIONAL SCIENCE - ICCS 2019, PT IV, 2019, 11539 : 451 - 464