Novel text classification based on K-nearest neighbor

被引:0
|
作者
Yu, Xiao-Peng [1 ,2 ]
Yu, Xiao-Gao [3 ]
机构
[1] Wuhan Univ, Sch Informat Management, Wuhan 430072, Peoples R China
[2] Wuhan Inst Technol, Sch Econ Management, Wuhan 430073, Peoples R China
[3] Hubei Univ Econ, Wuhan 430070, Peoples R China
关键词
K-nearest neighbor; P2P; kernel density estimation; text classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-nearest neighbors classifier (KNNC) is widely used because of its simplicity and efficiency. It includes k-nearest neighbors search (KNNS) and classification. Existing centralized KNNS does not scale up to large volume of data, and the classification still suffers from inductive biases that result from its assumptions, such as the presumption that training data are evenly distributed This paper proposes a method (P2PKNNC) which improves performance of kNN based text classification in the P2P communication paradigm. P2PKNNC adaptively executes k nearest neighbor(s) queries in a distributed metric structure, which is based on the generalized hyperplane partitioning. And it selects the influencing part from these neighbors and classifies the input document in term of the disturbance degree which it brings to the kernel densities of these influencing neighbors for uneven text sets. The experimental results indicate that our algorithm achieves significant classification performance improvement on imbalanced corpora.
引用
收藏
页码:3425 / +
页数:2
相关论文
共 50 条
  • [1] A Review of a Text Classification Technique: K-Nearest Neighbor
    Zhou, R. S.
    Wang, Z. J.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL APPLICATIONS (CISIA 2015), 2015, 18 : 453 - 455
  • [2] Feature Extraction based Text Classification using K-Nearest Neighbor Algorithm
    Azam, Muhammad
    Ahmed, Tanvir
    Sabah, Fahad
    Hussain, Muhammad Iftikhar
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (12): : 95 - 101
  • [3] Text categorization based on k-nearest neighbor approach for Web site classification
    Kwon, OW
    Lee, JH
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2003, 39 (01) : 25 - 44
  • [4] Automatic text categorization based on K-nearest neighbor
    Sun, J.
    Wang, W.
    Zhong, Y.-X.
    [J]. Beijing Youdian Xueyuan Xuebao/Journal of Beijing University of Posts And Telecommunications, 2001, 24 (01): : 42 - 46
  • [5] Improved k-nearest neighbor classification
    Wu, YQ
    Ianakiev, K
    Govindaraju, V
    [J]. PATTERN RECOGNITION, 2002, 35 (10) : 2311 - 2318
  • [6] K-nearest neighbor classification based on influence function
    College of Information Engineering, Zhengzhou University, Zhengzhou
    450052, China
    [J]. Dianzi Yu Xinxi Xuebao, 7 (1626-1632):
  • [7] Analysis of the k-nearest neighbor classification
    Li, Jing
    Cheng, Ming
    [J]. INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1911 - 1917
  • [8] KRA: K-Nearest Neighbor Retrieval Augmented Model for Text Classification
    Li, Jie
    Tang, Chang
    Lei, Zhechao
    Zhang, Yirui
    Li, Xuan
    Yu, Yanhua
    Pi, Renjie
    Hu, Linmei
    [J]. ELECTRONICS, 2024, 13 (16)
  • [9] Combining multiple k-nearest neighbor classifiers for text classification by reducts
    Bao, YG
    Ishii, N
    [J]. DISCOVERY SCIENCE, PROCEEDINGS, 2002, 2534 : 340 - 347
  • [10] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    [J]. 2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298