A New KNN Categorization Algorithm for Harmful Information Filtering

被引:1
|
作者
Du, Juan [1 ]
Yi, Zhi An [1 ]
机构
[1] Northeast Petr Univ, Software Coll, Da Qing, Peoples R China
关键词
component; Small sample pattern recognition; Virtual sample; Harmful information filtering; Network information security;
D O I
10.1109/ISCID.2012.128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The prediction result of classifier is biased towards the class with more samples, when the harmful text information is filtered. This is because that the samples that including the harmful information were difficult to gain. Construct virtual samples is an effective means to solve the problem of pattern recognition in the small sample, using the up-sampling method to construct virtual samples in the data layer, the traditional KNN algorithm has been improved: a small sample set is divided into clusters by using the K-means clustering, the virtual samples are generated and verified the validity in the cluster. The experimental results show that this method can construct the virtual samples which are similar to the real sample characteristics, and improved the classification effect of KNN algorithm.
引用
收藏
页码:489 / 492
页数:4
相关论文
共 50 条
  • [1] A Categorization Algorithm for Harmful Text Information Filtering
    Du, Juan
    Yi, Zhi An
    [J]. 2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 31 - 34
  • [2] A KNN BASED ALGORITHM FOR TEXT CATEGORIZATION
    Bucar, Joze
    Povh, Janez
    [J]. SOR'13 PROCEEDINGS: THE 12TH INTERNATIONAL SYMPOSIUM ON OPERATIONAL RESEARCH IN SLOVENIA, 2013, : 367 - 372
  • [3] Using KNN Algorithm for Text Categorization
    Wajeed, M. A.
    Adilakshmi, T.
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 796 - +
  • [4] A simple KNN algorithm for text categorization
    Soucy, P
    Mineau, GW
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 647 - 648
  • [5] A fast KNN algorithm for text categorization
    Wang, Yu
    Wang, Zheng-Ou
    [J]. PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3436 - +
  • [6] KNN Text Categorization Algorithm Based on Semantic Centre
    Zhang Xiao-fei
    Huang He-yan
    Zhang Ke-liang
    [J]. 2009 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND COMPUTER SCIENCE, VOL 1, PROCEEDINGS, 2009, : 249 - +
  • [7] The Research of kNN Text Categorization Algorithm Based On Eager Learning
    Dong, Tao
    Cheng, Weinan
    Shang, Wenqian
    [J]. 2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 1120 - 1123
  • [8] A Harmful Information Identification and Filtering Algorithm of Minority Ethnic Language Based on Boolean Model
    Nurbol
    Xie Nannan
    Jia Xue
    Yu Xiaodi
    Salamat
    Hu Liang
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 396 - 400
  • [9] A tool for image categorization to support information filtering
    Abramowicz, W
    Miklas-Kalczynska, M
    Kalczynski, PJ
    [J]. INTELLIGENT SYSTEMS, 2002, : 215 - 220
  • [10] Difference Factor' KNN Collaborative Filtering Recommendation Algorithm
    Liang, Wenzhong
    Lu, Guangquan
    Ji, Xiaoyu
    Li, Jian
    Yuan, Dingrong
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 175 - 184