A fast KNN algorithm for text categorization

被引:0
|
作者
Wang, Yu [1 ]
Wang, Zheng-Ou [2 ]
机构
[1] Hebei Univ, Sch Math & Comp Sci, Baoding 071002, Peoples R China
[2] Tianjin Univ, Inst Syst Engn, Tianjin 300072, Peoples R China
关键词
KNN; text categorization; similarity; SSR-tree;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The KNN algorithm applied to text categorization is a simple, valid and non-parameter method. The traditional KNN has a fatal defect that the time of similarity computing is huge. The practicality will be lost when the KNN algorithm Is applied to text categorization with the high dimension and huge samples. In this paper, a method called TFKNN(Tree-Fast-K-Nearest-Neighbor) is presented, which can search the exact k nearest neighbors quickly. In the method, a SSR tree for searching K nearest neighbors is created, in which all child nodes of each non-leaf node are ranked according to the distances between their central points and the central point of their parent. Then the searching scope is reduced based on the tree. Subsequently, the time of similarity computing is decreased largely.
引用
收藏
页码:3436 / +
页数:2
相关论文
共 50 条
  • [1] Using KNN Algorithm for Text Categorization
    Wajeed, M. A.
    Adilakshmi, T.
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 796 - +
  • [2] A KNN BASED ALGORITHM FOR TEXT CATEGORIZATION
    Bucar, Joze
    Povh, Janez
    [J]. SOR'13 PROCEEDINGS: THE 12TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2013, : 367 - 372
  • [3] A simple KNN algorithm for text categorization
    Soucy, P
    Mineau, GW
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 647 - 648
  • [4] KNN Text Categorization Algorithm Based on Semantic Centre
    Zhang Xiao-fei
    Huang He-yan
    Zhang Ke-liang
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 249 - +
  • [5] The Research of kNN Text Categorization Algorithm Based On Eager Learning
    Dong, Tao
    Cheng, Weinan
    Shang, Wenqian
    [J]. 2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 1120 - 1123
  • [6] Graph based KNN for Text Categorization
    Jo, Taeho
    [J]. 2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, : 260 - 265
  • [7] String Vector based KNN for Text Categorization
    Jo, Taeho
    [J]. 2017 19TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - OPENING NEW ERA OF SMART SOCIETY, 2017, : 458 - 463
  • [8] Using kNN model for automatic text categorization
    Gongde Guo
    Hui Wang
    David Bell
    Yaxin Bi
    Kieran Greer
    [J]. Soft Computing, 2006, 10 : 423 - 430
  • [9] Using kNN model for automatic text categorization
    Guo, GD
    Wang, H
    Bell, D
    Bi, YX
    Greer, K
    [J]. SOFT COMPUTING, 2006, 10 (05) : 423 - 430
  • [10] The Analysis and Optimization of KNN Algorithm Space-Time Efficiency for Chinese Text Categorization
    Cai, Ying
    Wang, Xiaofei
    [J]. ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT I, 2011, 214 : 542 - 550