kNN Classification with an Outlier Informative Distance Measure

被引:6
|
作者
Bhattacharya, Gautam [1 ]
Ghosh, Koushik [2 ]
Chowdhury, Ananda S. [3 ]
机构
[1] Univ Burdwan, Univ Inst Technol, Dept Phys, Bardhaman, India
[2] Univ Burdwan, Univ Inst Technol, Dept Math, Bardhaman, India
[3] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata, India
关键词
Outliers; Distance measure; kNN classification accuracy;
D O I
10.1007/978-3-319-69900-4_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification accuracy of the kNN algorithm is found to be adversely affected by the presence of outliers in the experimental datasets. An outlier score based on rank difference can be assigned to the points in these datasets by taking into consideration the distance and density of their local neighborhood points. In the present work, we introduce a generalized outlier informative distance measure where a factor based on the above score is used to modulate any potential distance function. Properties of the new outlier informative distance measure are presented. Experiments on several numeric datasets in the UCI machine learning repository clearly reveal the effectiveness of the proposed formulation.
引用
收藏
页码:21 / 27
页数:7
相关论文
共 50 条
  • [1] Reachable Distance Function for KNN Classification
    Zhang, Shichao
    Li, Jiaye
    Li, Yangding
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7382 - 7396
  • [2] Impact of the Sakoe-Chiba Band on the DTW Time-Series Distance Measure for kNN Classification
    Geler, Zoltan
    Kurbalija, Vladimir
    Radovanovic, Milos
    Ivanovic, Mirjana
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2014, 2014, 8793 : 105 - 114
  • [3] Fast kNN classification algorithm based on partial distance search
    Hwang, WJ
    Wen, KW
    ELECTRONICS LETTERS, 1998, 34 (21) : 2062 - 2063
  • [4] Fast kNN classification algorithm based on partial distance search
    Chung Yuan Christian Univ, Chungli, Taiwan
    Electron Lett, 21 (2062-2063):
  • [5] A Hierarchical Tree Distance Measure for Classification
    Caspersen, Kent Munthe
    Madsen, Martin Bjeldbak
    Eriksen, Andreas Berre
    Thiesson, Bo
    ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 502 - 509
  • [7] Locally centred Mahalanobis distance: A new distance measure with salient features towards outlier detection
    Todeschini, Roberto
    Ballabio, Davide
    Consonni, Viviana
    Sahigara, Faizan
    Filzmoser, Peter
    ANALYTICA CHIMICA ACTA, 2013, 787 : 1 - 9
  • [8] Outlier detection and data filling based on KNN and LOF for power transformer operation data classification
    Zou, Dexu
    Xiang, Yongjian
    Zhou, Tao
    Peng, Qingjun
    Dai, Weiju
    Hong, Zhihu
    Shi, Yong
    Wang, Shan
    Yin, Jianhua
    Quan, Hao
    ENERGY REPORTS, 2023, 9 : 698 - 711
  • [9] THE OPTIMAL DISTANCE MEASURE FOR NEAREST NEIGHBOR CLASSIFICATION
    SHORT, RD
    FUKUNAGA, K
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1981, 27 (05) : 622 - 627
  • [10] A TEXTURE-BASED DISTANCE MEASURE FOR CLASSIFICATION
    SHEN, HC
    BIE, CYC
    CHIU, DKY
    PATTERN RECOGNITION, 1993, 26 (09) : 1429 - 1437