Ensemble learning approach in Improved K Nearest Neighbor algorithm for Text Categorization

被引:0
|
作者
Iswarya, P. [1 ]
Radha, V. [1 ]
机构
[1] Avinashilingam Inst Home Sci & Higher Educ Women, Dept Comp Sci, Coimbatore, Tamil Nadu, India
关键词
Categorization; Clustering; Ensemble; K nearest neighbor; Support vector machine;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the tremendous growth of digital content in World Wide Web (WWW), Text categorization has become an important tool to manage and organize text related data. This paper proposes an Ensemble Learning approach in Improved K Nearest Neighbor algorithm for Text Categorization (EINNTC), which consists of single pass clustering, Ensemble learning and KNN algorithm. The EINNTC method provides solution to traditional KNN classifier issues, by reducing the huge text similarity computation complexity, avoids an impact of noisy training sample, and expediting the process of finding K nearest neighbors. The experiments were carried out with standard benchmark Reuters dataset, and their empirical results shows that the proposed method outperforms the SVM and KNN classifiers.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] An improved K-nearest-neighbor algorithm for text categorization
    Jiang, Shengyi
    Pang, Guansong
    Wu, Meiling
    Kuang, Limin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) : 1503 - 1509
  • [2] Text Categorization with K-Nearest Neighbor Approach
    Manne, Suneetha
    Kotha, Sita Kumari
    Fatima, S. Sameen
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 413 - +
  • [3] K-Nearest Neighbor Algorithm Optimization in Text Categorization
    Chen, Shufeng
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION (ESMA2017), VOLS 1-4, 2018, 108
  • [4] Binary k-nearest neighbor for text categorization
    Tan, SB
    [J]. ONLINE INFORMATION REVIEW, 2005, 29 (04) : 391 - 399
  • [5] IMPROVING K-NEAREST NEIGHBOR EFFICIENCY FOR TEXT CATEGORIZATION
    Barigou, F.
    [J]. NEURAL NETWORK WORLD, 2016, 26 (01) : 45 - 65
  • [6] Automatic text categorization based on K-nearest neighbor
    Sun, J.
    Wang, W.
    Zhong, Y.-X.
    [J]. Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 2001, 24 (01): : 42 - 46
  • [7] Text categorization based on k-nearest neighbor approach for Web site classification
    Kwon, OW
    Lee, JH
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) : 25 - 44
  • [8] Improving K Nearest Neighbor into String Vector Version for Text Categorization
    Jo, Taeho
    [J]. 2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION, 2019, : 1091 - 1097
  • [9] A new nearest neighbor rule for text categorization
    Gil-Garcia, Reynaldo
    Pons-Porrata, Aurora
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2006, 4225 : 814 - 823
  • [10] Research on the Improvement of K-Nearest Neighbor Classifier for Imbalanced Text Categorization
    Yang Yanmei
    Xu Linying
    [J]. 2018 EIGHTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC 2018), 2018, : 968 - 972