Noise reduction to text categorization based on density for KNN

被引:7
|
作者
Li, RL [1 ]
Hu, YF [1 ]
机构
[1] Fudan Univ, Comp Technol & Informat Dept, Shanghai 200433, Peoples R China
关键词
text classification; k-Nearest Neighbor;
D O I
10.1109/ICMLC.2003.1260115
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of World Wide Web,. text classification has become the key technology in organizing and processing large amount of document data. As a simple and effective classification approach, KNN method is widely used in text categorization. But KNN classifier not only has the large computational demands, but also may result in the decrease of precision of classification because of uneven density of training data. In this paper, we present a density-based method for reducing the noises of training data, which solves these problems. Our experiment results also illustrate it.
引用
收藏
页码:3119 / 3124
页数:6
相关论文
共 50 条
  • [1] Graph based KNN for Text Categorization
    Jo, Taeho
    [J]. 2018 20TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT), 2018, : 260 - 265
  • [2] A KNN BASED ALGORITHM FOR TEXT CATEGORIZATION
    Bucar, Joze
    Povh, Janez
    [J]. SOR'13 PROCEEDINGS: THE 12TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2013, : 367 - 372
  • [3] String Vector based KNN for Text Categorization
    Jo, Taeho
    [J]. 2017 19TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - OPENING NEW ERA OF SMART SOCIETY, 2017, : 458 - 463
  • [4] KNN Text Categorization Algorithm Based on Semantic Centre
    Zhang Xiao-fei
    Huang He-yan
    Zhang Ke-liang
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 249 - +
  • [5] Research on text categorization model based on LDA - KNN
    Chen, Weihua
    Zhang, Xian
    [J]. 2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, : 2719 - 2726
  • [6] Uncertainty-based noise reduction and term selection in text categorization
    Peters, C
    Koster, CHA
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2002, 2291 : 248 - 267
  • [7] Using KNN Algorithm for Text Categorization
    Wajeed, M. A.
    Adilakshmi, T.
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 796 - +
  • [8] A simple KNN algorithm for text categorization
    Soucy, P
    Mineau, GW
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 647 - 648
  • [9] A fast KNN algorithm for text categorization
    Wang, Yu
    Wang, Zheng-Ou
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3436 - +
  • [10] KNN with TF-IDF Based Framework for Text Categorization
    Trstenjak, Bruno
    Mikac, Sasa
    Donko, Dzenana
    [J]. 24TH DAAAM INTERNATIONAL SYMPOSIUM ON INTELLIGENT MANUFACTURING AND AUTOMATION, 2013, 2014, 69 : 1356 - 1364