An Improved KNN Algorithm for Text Classification

被引:2
|
作者
Li, Huijuan [1 ]
Jiang, He [1 ]
Wang, Dongyuan [1 ]
Han, Bing [1 ]
机构
[1] Qilu Univ Technol, ShanDong Acad Sci, Jinan, Peoples R China
关键词
KNN; text classification; similarity; coupling;
D O I
10.1109/IMCCC.2018.00225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Among the many text classification algorithms based on vector space model, the effect of KNN(K-Nearest Neighbor) classifier is outstanding. For KNN classification algorithm, calculating the similarity between documents will directly affect the selections of K neighbors, which greatly affects the classification effect. However, the traditional KNN text classification is too rough to calculate text similarity, ignoring the relations within the document and the relationships between the documents. Therefore, this paper proposes an improved KNN algorithm, which calculates similarity by considering the interaction and coupling relationship between the document internal and the document. Theoretical analysis and experiments show that the improved algorithm can overcome the shortcomings of the previous algorithms and improve the accuracy of the KNN text classification.
引用
收藏
页码:1081 / 1085
页数:5
相关论文
共 50 条
  • [1] An Improved KNN Algorithm in Text Classification
    Wang, Xiaoni
    Zhang, Zhenjiang
    Cao, Wei
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND COMPUTER APPLICATIONS (ICSA 2013), 2013, 92 : 263 - 268
  • [2] An Improved KNN Text Classification Algorithm based on Simhash
    Liu, Jie
    Jin, Ting
    Pan, Kejia
    Yang, Yi
    Wu, Yan
    Wang, Xin
    [J]. 2017 IEEE 16TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2017, : 92 - 95
  • [3] Improved KNN Text Classification Algorithm with MapReduce Implementation
    Zhao, Yan
    Qian, Yun
    Li, Cuixia
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1417 - 1422
  • [4] An Improved KNN Text Classification Algorithm Based on Clustering
    Zhou Yong
    Li Youwen
    Xia Shixiong
    [J]. JOURNAL OF COMPUTERS, 2009, 4 (03) : 230 - 237
  • [5] AN IMPROVED KNN TEXT CLASSIFICATION ALGORITHM BASED ON DENSITY
    Shi, Kansheng
    Li, Lemin
    Liu, Haitao
    He, Jie
    Zhang, Naitong
    Song, Wentao
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS, 2011, : 113 - 117
  • [6] An improved kNN text classification method
    Wang, Fengfei
    Liu, Zhen
    Wang, Chundong
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2019, 20 (03) : 397 - 403
  • [7] An improved web text classification algorithm based on SVM-KNN
    Cao, Jianfang
    Chen, Junjie
    [J]. ADVANCES IN MECHATRONICS AND CONTROL ENGINEERING, PTS 1-3, 2013, 278-280 : 1305 - 1308
  • [8] A Clustering-Based KNN Improved Algorithm CLKNN for Text Classification
    Zhou, Lijuan
    Wang, Linshuang
    Ge, Xuebin
    Shi, Qian
    [J]. 2010 2ND INTERNATIONAL ASIA CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION AND ROBOTICS (CAR 2010), VOL 3, 2010, : 212 - 215
  • [9] An Improved Weighted KNN Algorithm About Text Classification Based on Spark Framework
    Yang, Tianming
    Du, Shaobo
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 655 - 661
  • [10] A non-VSM kNN algorithm for text classification
    Deng, ZH
    Tang, SW
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 339 - 346