K-Nearest Neighbor Algorithm Optimization in Text Categorization

被引:12
|
作者
Chen, Shufeng [1 ]
机构
[1] Univ Sci & Technol China, Res Inst Elect Sci & Technol, Chengdu 611730, Sichuan, Peoples R China
关键词
D O I
10.1088/1755-1315/108/5/052074
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
K-Nearest Neighbor (KNN) classification algorithm is one of the simplest methods of data mining. It has been widely used in classification, regression and pattern recognition. The traditional KNN method has some shortcomings such as large amount of sample computation and strong dependence on the sample library capacity. In this paper, a method of representative sample optimization based on CURE algorithm is proposed. On the basis of this, presenting a quick algorithm QKNN (Quick k-nearest neighbor) to find the nearest k neighbor samples, which greatly reduces the similarity calculation. The experimental results show that this algorithm can effectively reduce the number of samples and speed up the search for the k nearest neighbor samples to improve the performance of the algorithm.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Text Categorization with K-Nearest Neighbor Approach
    Manne, Suneetha
    Kotha, Sita Kumari
    Fatima, S. Sameen
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 413 - +
  • [2] Binary k-nearest neighbor for text categorization
    Tan, SB
    [J]. ONLINE INFORMATION REVIEW, 2005, 29 (04) : 391 - 399
  • [3] IMPROVING K-NEAREST NEIGHBOR EFFICIENCY FOR TEXT CATEGORIZATION
    Barigou, F.
    [J]. NEURAL NETWORK WORLD, 2016, 26 (01) : 45 - 65
  • [4] Research on the Improvement of K-Nearest Neighbor Classifier for Imbalanced Text Categorization
    Yang Yanmei
    Xu Linying
    [J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 968 - 972
  • [5] Application of k-Nearest Neighbor on feature projections classifier to text categorization
    Yavuz, T
    Guvenir, HA
    [J]. ADVANCES IN COMPUTER AND INFORMATION SCIENCES '98, 1998, 53 : 135 - 142
  • [6] An improved K-nearest-neighbor algorithm for text categorization
    Jiang, Shengyi
    Pang, Guansong
    Wu, Meiling
    Kuang, Limin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 1503 - 1509
  • [7] Modular k-nearest neighbor classification method for massively parallel text categorization
    Zhao, H
    Lu, BL
    [J]. COMPUTATIONAL AND INFORMATION SCIENCE, PROCEEDINGS, 2004, 3314 : 867 - 872
  • [8] Text categorization based on k-nearest neighbor approach for Web site classification
    Kwon, OW
    Lee, JH
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) : 25 - 44
  • [9] Optimization of the Neighbor Parameter of k-Nearest Neighbor Algorithm for Collaborative Filtering
    Vaghela, Vimalkumar B.
    Pathak, Himalay H.
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMMUNICATION AND NETWORKS, 2017, 508 : 87 - 93
  • [10] A FUZZY K-NEAREST NEIGHBOR ALGORITHM
    KELLER, JM
    GRAY, MR
    GIVENS, JA
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (04): : 580 - 585