Wikipedia Based Short Text Classification Method

被引:8
|
作者
Li, Junze [1 ]
Cai, Yi [1 ]
Cai, Zhiwei [1 ]
Leung, Hofung [2 ]
Yang, Kai [1 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou, Guangdong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Short text classification; Concept; Wikipedia;
D O I
10.1007/978-3-319-55705-2_22
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Short text is usually expressed in refined slightly, insufficient information, which makes text classification difficult. But we can try to introduce some information from the existing knowledge base to strengthen the performance of short text classification. Wikipedia [2,13,15] is now the largest human-edited knowledge base of high quality. It would benefit to short text classification if we can make full use of Wikipedia information in short text classification. This paper presents a new concept based [22] on Wikipedia short text representation method, by identifying the concept of Wikipedia mentioned in short text, and then expand the concept of wiki correlation and short text messages to the feature vector representation.
引用
收藏
页码:275 / 286
页数:12
相关论文
共 50 条
  • [1] Short Text Classification using Wikipedia Concept based Document Representation
    Wang, Xiang
    Chen, Ruhua
    Jia, Yan
    Zhou, Bin
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA), 2013, : 471 - 474
  • [2] Short Text Classification Based on Wikipedia and Word2vec
    Liu Wensen
    Cao Zewen
    Wang Jun
    Wang Xiaoyi
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1195 - 1200
  • [3] A Sample Extension Method Based on Wikipedia and Its Application in Text Classification
    Wenhao Zhu
    Yiting Liu
    Guannan Hu
    Jianyue Ni
    Zhiguo Lu
    [J]. Wireless Personal Communications, 2018, 102 : 3851 - 3867
  • [4] A Sample Extension Method Based on Wikipedia and Its Application in Text Classification
    Zhu, Wenhao
    Liu, Yiting
    Hu, Guannan
    Ni, Jianyue
    Lu, Zhiguo
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (04) : 3851 - 3867
  • [5] Semantic dictionary based method for short text classification
    Tang, Hao-Jin
    Yan, Dan-Feng
    Tian, Yuan
    [J]. Journal of China Universities of Posts and Telecommunications, 2013, 20 (SUPPL. 1): : 15 - 19
  • [6] Short Text Classification With A Convolutional Neural Networks Based Method
    Hu, Yibo
    Li, Yang
    Yang, Tao
    Pan, Quan
    [J]. 2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 1432 - 1435
  • [7] Wikipedia-based cross-language text classification
    Mourino Garcia, Marcos Antonio
    Perez Rodriguez, Roberto
    Anido Rifon, Luis
    [J]. INFORMATION SCIENCES, 2017, 406 : 12 - 28
  • [8] Semantic Enrichment of Text Representation with Wikipedia for Text Classification
    Yamakawa, Hiroki
    Peng, Jing
    Feldman, Anna
    [J]. IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [9] A Short Text Classification Method Based on N-Gram and CNN
    Wang, Haitao
    He, Jie
    Zhang, Xiaohong
    Liu, Shufen
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2020, 29 (02) : 248 - 254
  • [10] Method of Feature Reduction in Short Text Classification Based on Feature Clustering
    Li, Fangfang
    Yin, Yao
    Shi, Jinjing
    Mao, Xingliang
    Shi, Ronghua
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (08):