Application of Knowledge Gain on Multi-Type Feature Space in Microblog User Classification

被引:0
|
作者
Yan, Xu [1 ,2 ]
机构
[1] Beijing Language & Culture Univ, Beijing, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
关键词
knowledge gain; feature selection; text classification; user classification; microblog;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature selection plays an important role in text categorization. Classic feature selection methods such as document frequency (DF), information gain (IG), mutual information (MI) are commonly applied in text categorization. But usually they only take plain text into account. Knowledge Gain (KG) is a new feature selection method which is proposed in my previous paper. It measures attribute's importance based on Rough Set theory. Experiment shows that it performs well in traditional text classification, and it has obvious advantage in unbalanced corpus in recall rate. Unlike traditional text classification, characteristics of microblog reflected in short text and special structure networks, including user social network and behavior network. This results in less text information and more behavior and social information of microblog users. The classic feature selection algorithms, which are proposed based on text feature, is not applicable. In this paper, we validated that KG which is proposed based on the rough set knowledge can select optimal feature consistently in multi-type feature space of microblog user classification. Experiment shows that it has better performance in multi-type feature selection than other classic feature selection methods.
引用
收藏
页码:340 / 345
页数:6
相关论文
共 50 条
  • [41] Concrete Multi-Type Defect Classification Algorithm Based on MSSMA-SVM
    Tian, Xu
    Ao, Jun
    Ma, Zizhu
    Jian, Bijian
    Ma, Chunbo
    [J]. SENSORS, 2022, 22 (23)
  • [42] Relation-based multi-type aware knowledge graph embedding q
    Xue, Yingying
    Jin, Jiahui
    Song, Aibo
    Zhang, Yingxue
    Liu, Yangyang
    Wang, Kaixuan
    [J]. NEUROCOMPUTING, 2021, 456 : 11 - 22
  • [43] Rethinking the relationship between technical and local knowledge: Toward a multi-type approach
    Negev, Maya
    Teschner, Naama
    [J]. ENVIRONMENTAL SCIENCE & POLICY, 2013, 30 : 50 - 59
  • [44] User behavior prediction model based on implicit links and multi-type rumor messages
    Li, Qian
    Xie, YuFeng
    Wu, XinHong
    Xiao, Yunpeng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 262
  • [45] Classification and Localization of Multi-Type Abnormalities on Chest X-Rays Images
    Elhanashi, Abdussalam
    Saponara, Sergio
    Zheng, Qinghe
    [J]. IEEE ACCESS, 2023, 11 : 83264 - 83277
  • [46] Adaptive vision inspection for multi-type electronic products based on prior knowledge
    Zhao, Delong
    Xue, Dun
    Wang, Xiaoyao
    Du, Fuzhou
    [J]. JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2022, 27
  • [47] Fusion mode of multi-type scientific and technological information and its application
    曾文
    LIU Xiaolin
    MA Hongyan
    [J]. High Technology Letters, 2024, 30 (04) : 433 - 440
  • [48] Domain knowledge-guided intelligent recognition of multi-type potential landslides
    Liu, Qinghao
    Liu, Huimin
    Lan, Qing
    Li, Kui
    Huang, Cheng
    Yang, Xuexi
    [J]. Knowledge-Based Systems, 2025, 310
  • [49] Fusion mode of multi-type scientific and technological information and its application
    Zeng, Wen
    Liu, Xiaolin
    Ma, Hongyan
    [J]. High Technology Letters, 2024, 30 (04) : 433 - 440
  • [50] A novel feature-based framework enabling multi-type DDoS attacks detection
    Zhou, Lu
    Zhu, Ye
    Xiang, Yong
    Zong, Tianrui
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (01): : 163 - 185