kNN Classification with an Outlier Informative Distance Measure

被引:6
|
作者
Bhattacharya, Gautam [1 ]
Ghosh, Koushik [2 ]
Chowdhury, Ananda S. [3 ]
机构
[1] Univ Burdwan, Univ Inst Technol, Dept Phys, Bardhaman, India
[2] Univ Burdwan, Univ Inst Technol, Dept Math, Bardhaman, India
[3] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata, India
关键词
Outliers; Distance measure; kNN classification accuracy;
D O I
10.1007/978-3-319-69900-4_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification accuracy of the kNN algorithm is found to be adversely affected by the presence of outliers in the experimental datasets. An outlier score based on rank difference can be assigned to the points in these datasets by taking into consideration the distance and density of their local neighborhood points. In the present work, we introduce a generalized outlier informative distance measure where a factor based on the above score is used to modulate any potential distance function. Properties of the new outlier informative distance measure are presented. Experiments on several numeric datasets in the UCI machine learning repository clearly reveal the effectiveness of the proposed formulation.
引用
收藏
页码:21 / 27
页数:7
相关论文
共 50 条
  • [21] The spatial outlier mining algorithm based on the KNN graph
    Cao, Lijun
    Liu, Xiyin
    Wang, Zhi Ping
    Zhang, Zhongping
    Journal of Software, 2013, 8 (12) : 3158 - 3165
  • [22] KNN Based Outlier Detection Algorithm in Large Dataset
    Yang, Peng
    Huang, Biao
    2008 INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND TRAINING AND 2008 INTERNATIONAL WORKSHOP ON GEOSCIENCE AND REMOTE SENSING, VOL 1, PROCEEDINGS, 2009, : 611 - 613
  • [23] Evaluation of different text representation techniques and distance metrics using KNN for documents classification
    Alexander Calvo-Valverde, Luis
    Andres Mena-Arias, Jose
    TECNOLOGIA EN MARCHA, 2020, 33 (01): : 64 - 79
  • [24] kNN Classification: a review
    Syriopoulos, Panos K.
    Kalampalikis, Nektarios G.
    Kotsiantis, Sotiris B.
    Vrahatis, Michael N.
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2023,
  • [25] On the reliability of kNN classification
    Tsypin, Maxim
    Roder, Heinrich
    WCECS 2007: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, 2007, : 732 - 734
  • [26] Challenges in KNN Classification
    Zhang, Shichao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (10) : 4663 - 4675
  • [27] Outlier Detection Based on Reversed K-Nearest Neighborhood MST of Relative Distance Measure
    Yang X.-L.
    Feng S.
    Yuan Z.
    Feng, Shan (fengshanrq@sohu.com), 2020, Chinese Institute of Electronics (48): : 937 - 945
  • [28] Classification via k-Means Clustering and Distance-Based Outlier Detection
    Songma, Surasit
    Chimphlee, Witcha
    Maichalernnukul, Kiattisak
    Sanguansat, Parinya
    2012 TENTH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING, 2012, : 125 - 128
  • [29] Interval Entropy and Informative Distance
    Misagh, Fakhroddin
    Yari, Gholamhossein
    ENTROPY, 2012, 14 (03) : 480 - 490
  • [30] BHATTACHARYYA DISTANCE BASED EMOTIONAL DISSIMILARITY MEASURE FOR EMOTION CLASSIFICATION
    Tin Lay Nwe
    Nguyen Trung Hieu
    Limbu, Dilip Kumar
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7512 - 7516