Application of Knowledge Gain on Multi-Type Feature Space in Microblog User Classification

被引:0
|
作者
Yan, Xu [1 ,2 ]
机构
[1] Beijing Language & Culture Univ, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
关键词
knowledge gain; feature selection; text classification; user classification; microblog;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature selection plays an important role in text categorization. Classic feature selection methods such as document frequency (DF), information gain (IG), mutual information (MI) are commonly applied in text categorization. But usually they only take plain text into account. Knowledge Gain (KG) is a new feature selection method which is proposed in my previous paper. It measures attribute's importance based on Rough Set theory. Experiment shows that it performs well in traditional text classification, and it has obvious advantage in unbalanced corpus in recall rate. Unlike traditional text classification, characteristics of microblog reflected in short text and special structure networks, including user social network and behavior network. This results in less text information and more behavior and social information of microblog users. The classic feature selection algorithms, which are proposed based on text feature, is not applicable. In this paper, we validated that KG which is proposed based on the rough set knowledge can select optimal feature consistently in multi-type feature space of microblog user classification. Experiment shows that it has better performance in multi-type feature selection than other classic feature selection methods.
引用
收藏
页码:340 / 345
页数:6
相关论文
共 50 条
  • [1] Multi-type spectral spatial feature for hyperspectral image classification
    Yuan, Yuan
    Jin, Mingxin
    [J]. NEUROCOMPUTING, 2022, 492 : 637 - 650
  • [2] Study of Feature Extract on Microblog User Occupation Classification
    Zhou, Meilin
    Xu, Yan
    Zhao, Xiaodan
    [J]. 2012 INTERNATIONAL SYMPOSIUM ON INFORMATION SCIENCE AND ENGINEERING (ISISE), 2012, : 20 - 23
  • [3] Multi-label Emotion Classification for Microblog Based on CNN Feature Space
    Sun S.
    He Y.
    [J]. 1600, Sichuan University (49): : 162 - 169
  • [4] A robust approach for multi-type classification of brain tumor using deep feature fusion
    Chen, Wenna
    Tan, Xinghua
    Zhang, Jincan
    Du, Ganqin
    Fu, Qizhi
    Jiang, Hongwei
    [J]. FRONTIERS IN NEUROSCIENCE, 2024, 18
  • [5] Multi-Type Classification Comparison of Mammogram Abnormalities
    Sowmyayani, S.
    Murugan, V
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2021, 21 (03)
  • [6] Sparse regression with Multi-type Regularized Feature modeling
    Devriendt, Sander
    Antonio, Katrien
    Reynkens, Tom
    Verbelen, Roel
    [J]. INSURANCE MATHEMATICS & ECONOMICS, 2021, 96 : 248 - 261
  • [7] A detection method for multi-type earth's surface anomalies based on multi-dimensional feature space
    Wei, Haishuo
    Jia, Kun
    Wang, Qiao
    Cao, Biao
    Qi, Jianbo
    Zhao, Wenzhi
    Yan, Kai
    Wang, Guoqiang
    Xue, Baolin
    Yan, Xing
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [8] Ensemble Learning for Multi-Type Classification in Heterogeneous Networks
    Serafino, Francesco
    Pio, Gianvito
    Ceci, Michelangelo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (12) : 2326 - 2339
  • [9] Multi-type clustering and classification from heterogeneous networks
    Pio, Gianvito
    Serafino, Francesco
    Malerba, Donato
    Ceci, Michelangelo
    [J]. INFORMATION SCIENCES, 2018, 425 : 107 - 126
  • [10] Multi-type skin diseases classification using OP-DNN based feature extraction approach
    Arushi Jain
    Annavarapu Chandra Sekhara Rao
    Praphula Kumar Jain
    Ajith Abraham
    [J]. Multimedia Tools and Applications, 2022, 81 : 6451 - 6476