An Improved KNN Algorithm for Text Classification

被引:2
|
作者
Li, Huijuan [1 ]
Jiang, He [1 ]
Wang, Dongyuan [1 ]
Han, Bing [1 ]
机构
[1] Qilu Univ Technol, ShanDong Acad Sci, Jinan, Peoples R China
关键词
KNN; text classification; similarity; coupling;
D O I
10.1109/IMCCC.2018.00225
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Among the many text classification algorithms based on vector space model, the effect of KNN(K-Nearest Neighbor) classifier is outstanding. For KNN classification algorithm, calculating the similarity between documents will directly affect the selections of K neighbors, which greatly affects the classification effect. However, the traditional KNN text classification is too rough to calculate text similarity, ignoring the relations within the document and the relationships between the documents. Therefore, this paper proposes an improved KNN algorithm, which calculates similarity by considering the interaction and coupling relationship between the document internal and the document. Theoretical analysis and experiments show that the improved algorithm can overcome the shortcomings of the previous algorithms and improve the accuracy of the KNN text classification.
引用
收藏
页码:1081 / 1085
页数:5
相关论文
共 50 条
  • [41] Improved Feature Weight Algorithm and Its Application to Text Classification
    Shang, Songtao
    Shi, Minyong
    Shang, Wenqian
    Hong, Zhiguo
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
  • [42] An improved KNN algorithm for boolean sequence
    Wang, Zhen-Hua
    Hou, Zhong-Sheng
    Gao, Ying
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2009, 22 (02): : 330 - 336
  • [43] Text Classification Research Based on Improved SoftMax Regression Algorithm
    She, Xiangyang
    Zhu, Yinglong
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 273 - 276
  • [44] Application of an Improved Convolutional Neural Network Algorithm in Text Classification
    Peng, Jing
    Huo, Shuquan
    [J]. JOURNAL OF WEB ENGINEERING, 2024, 23 (03): : 315 - 340
  • [45] Research on Feature Selection and kNN Classification Method in Chinese Text Classification
    Xiao Chao
    Wu Ping
    [J]. PROCEEDINGS OF THE 2015 4TH NATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING ( NCEECE 2015), 2016, 47 : 956 - 962
  • [46] Research on Text Classification Based on SVM-KNN
    Lin, Yun
    Wang, Jie
    [J]. 2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 842 - 844
  • [47] Active Transductive KNN for Sparsely Labeled Text Classification
    Xiao, Wang-xin
    Zhang, Xue
    [J]. 6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 2178 - 2182
  • [48] KNN Text Categorization Algorithm Based on Semantic Centre
    Zhang Xiao-fei
    Huang He-yan
    Zhang Ke-liang
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 249 - +
  • [49] Text clustering based on kernel KNN clustering algorithm
    Xiong, Hao
    Sun, Sheng
    Feng, Yunfang
    [J]. International Journal of Applied Mathematics and Statistics, 2013, 46 (16): : 69 - 75
  • [50] Text and Image Based Spam Email Classification using KNN, Naive Bayes and Reverse DBSCAN Algorithm
    Harisinghaney, Anirudh
    Dixit, Aman
    Gupta, Saurabh
    Arora, Anuja
    [J]. PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON RELIABILTY, OPTIMIZATION, & INFORMATION TECHNOLOGY (ICROIT 2014), 2014, : 153 - 155