Ensemble of multiple kNN classifiers for societal risk classification

被引:0
|
作者
Jindong Chen
Xijin Tang
机构
[1] Chinese Academy of Sciences,Academy of Mathematics and Systems Science
[2] China Aerospace Academy of Systems Science and Engineering,undefined
来源
Journal of Systems Science and Systems Engineering | 2017年 / 26卷
关键词
Societal risk classification; Tianya Forum; k-Nearest Neighbor; ensemble; Paragraph Vector;
D O I
暂无
中图分类号
学科分类号
摘要
Societal risk classification is a fundamental and complex issue for societal risk perception. To conduct societal risk classification, Tianya Forum posts are selected as the data source, and four kinds of representations: string representation, term-frequency representation, TF-IDF representation and the distributed representation of BBS posts are applied. Using edit distance or cosine similarity as distance metric, four k-Nearest Neighbor (kNN) classifiers based on different representations are developed and compared. Owing to the priority of word order and semantic extraction of the neural network model Paragraph Vector, kNN based on the distributed representation generated by Paragraph Vector (kNN-PV) shows effectiveness for societal risk classification. Furthermore, to improve the performance of societal risk classification, through different weights, kNN-PV is combined with other three kNN classifiers as an ensemble model. Through brute force grid search method, the optimal weights are assigned to different kNN classifiers. Compared with kNN-PV, the experimental results reveal that Macro-F of the ensemble method is significantly improved for societal risk classification.
引用
收藏
页码:433 / 447
页数:14
相关论文
共 50 条
  • [1] ENSEMBLE OF MULTIPLE kNN CLASSIFIERS FOR SOCIETAL RISK CLASSIFICATION
    Chen, Jindong
    Tang, Xijin
    JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2017, 26 (04) : 433 - 447
  • [2] Ensemble of SVM Classifiers with Different Representations for Societal Risk Classification
    Chen, Jindong
    Tang, Xijin
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2015, 2015, 9403 : 669 - 675
  • [3] Ensemble of a subset of kNN classifiers
    Asma Gul
    Aris Perperoglou
    Zardad Khan
    Osama Mahmoud
    Miftahuddin Miftahuddin
    Werner Adler
    Berthold Lausen
    Advances in Data Analysis and Classification, 2018, 12 : 827 - 840
  • [4] Ensemble of a subset of kNN classifiers
    Gul, Asma
    Perperoglou, Aris
    Khan, Zardad
    Mahmoud, Osama
    Miftahuddin, Miftahuddin
    Adler, Werner
    Lausen, Berthold
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (04) : 827 - 840
  • [5] Ensemble of multiple CNN classifiers for HSI classification with Superpixel Smoothing
    Sikakollu, Prasanth
    Dash, Ratnakar
    COMPUTERS & GEOSCIENCES, 2021, 154
  • [6] Editing training data for kNN classifiers with neural network ensemble
    Jiang, Y
    Zhou, ZH
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 356 - 361
  • [7] Ensemble of Multiple Classifiers for Multilabel Classification of Plant Protein Subcellular Localization
    Wattanapornprom, Warin
    Thammarongtham, Chinae
    Hongsthong, Apiradee
    Lertampaiporn, Supatcha
    LIFE-BASEL, 2021, 11 (04):
  • [8] CIFAR-10: KNN-based Ensemble of Classifiers
    Abouelnaga, Yehya
    Ali, Ola S.
    Rady, Hager
    Moustafa, Mohamed
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1192 - 1195
  • [9] Malware Classification Using Ensemble Classifiers
    Hijazi, Mohd Hanafi Ahmad
    Beng, Tan Choon
    Mountstephens, James
    Lim, Yuto
    Nisar, Kashif
    ADVANCED SCIENCE LETTERS, 2018, 24 (02) : 1172 - 1176
  • [10] Performance Analysis of Lung Cancer Classification using Multiple Feature Extraction with SVM and KNN Classifiers
    Ashwini, S. S.
    Kurain, M. Z.
    Nagaraja, M.
    2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,