NearCount: Selecting critical instances based on the cited counts of nearest neighbors

被引:12
|
作者
Zhu, Zonghai [1 ,2 ]
Wang, Zhe [1 ,2 ]
Li, Dongdong [2 ]
Du, Wenli [1 ]
机构
[1] East China Univ Sci & Technol, Minist Educ, Key Lab Adv Control & Optimizat Chem Proc, Shanghai 200237, Peoples R China
[2] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai 200237, Peoples R China
基金
美国国家科学基金会;
关键词
Critical instance; Nearest neighbor; Cited counts; Imbalanced problem; Instance selection; DATA REDUCTION; CLASSIFICATION; SMOTE;
D O I
10.1016/j.knosys.2019.105196
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional instance selection algorithms are not good at addressing imbalanced problems. Moreover, most of them are sensitive to noise instances and suffer from complex selection rules. To solve these problems, in this paper, we propose a concise learning framework named NearCount to determine the importance of the instance without editing noise. In NearCount, the importance of an instance corresponds to the cited counts. The count is determined by the number of times that one instance is selected as a nearest neighbor of instances in different classes. For the instances with nonzero cited counts, the importance of the instance is inversely proportional to the cited count. To handle classification problems with different data distributions, two detailed NearCount-based algorithms - NearCount-IM and NearCount-IS - are introduced. For imbalanced problems, NearCount-IM selects the important majority instances with an equal number of minority instances, thus balancing the data distribution. For balanced scenarios, NearCount-IS selects the instances whose cited counts are greater than zero and equal or less than the number of nearest neighbors as critical instances in every class. The proposed NearCount-IM and NearCount-IS algorithms are evaluated by comparing them with classical instance selection algorithms on the benchmark data sets. Experiments validate the effectiveness of the proposed algorithms. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Predicting the Trajectory of a Flying Body Based on Weighted Nearest Neighbors
    Mironov, Konstantin
    Pongratz, Martin
    Dietrich, Dietmar
    INTERNATIONAL WORK-CONFERENCE ON TIME SERIES (ITISE 2014), 2014, : 699 - 710
  • [22] A Novel GEP-Based Cluster Algorithm for Nearest Neighbors
    Cai, Hongguo
    Yuan, Chang-an
    EMERGING RESEARCH IN ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, 2012, 315 : 243 - +
  • [23] Thread-based implementations of the false nearest neighbors method
    Marin Carrion, I.
    Arias Antunez, E.
    Artigao Castillo, M. M.
    Aguila Guerrero, J. J.
    Miralles Canals, J. J.
    PARALLEL COMPUTING, 2009, 35 (10-11) : 523 - 534
  • [24] All-nearest-neighbors finding based on the Hilbert curve
    Chen, Hue-Ling
    Chang, Ye-In
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (06) : 7462 - 7475
  • [25] Some Multivariate Tests of Independence Based on Ranks of Nearest Neighbors
    Sarkar, Soham
    Ghosh, Anil K.
    TECHNOMETRICS, 2018, 60 (01) : 101 - 111
  • [26] Chameleon algorithm based on mutual k-nearest neighbors
    Yuru Zhang
    Shifei Ding
    Lijuan Wang
    Yanru Wang
    Ling Ding
    Applied Intelligence, 2021, 51 : 2031 - 2044
  • [27] Ensemble k-nearest neighbors based on centroid displacement
    Wang, Alex X.
    Chukova, Stefanka S.
    Nguyen, Binh P.
    INFORMATION SCIENCES, 2023, 629 : 313 - 323
  • [28] A Novel Image Clustering Algorithm Based on Supported Nearest Neighbors
    Li, Lin
    Zhang, Feng
    Zhang, Jiashuai
    Hua, Qiang
    Dong, Chun-Ru
    Lim, Chee-Peng
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2023,
  • [29] An Improved Clustering Algorithm Based on Density Peak and Nearest Neighbors
    Zhao, Chao
    Yang, Junchuang
    Wen, Kexin
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [30] Nearest neighbors based density peaks approach to intrusion detection
    Li, Lixiang
    Zhang, Hao
    Peng, Haipeng
    Yang, Yixian
    CHAOS SOLITONS & FRACTALS, 2018, 110 : 33 - 40