Wikipedia Based Short Text Classification Method

被引:8
|
作者
Li, Junze [1 ]
Cai, Yi [1 ]
Cai, Zhiwei [1 ]
Leung, Hofung [2 ]
Yang, Kai [1 ]
机构
[1] South China Univ Technol, Sch Software Engn, Guangzhou, Guangdong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Sha Tin, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Short text classification; Concept; Wikipedia;
D O I
10.1007/978-3-319-55705-2_22
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Short text is usually expressed in refined slightly, insufficient information, which makes text classification difficult. But we can try to introduce some information from the existing knowledge base to strengthen the performance of short text classification. Wikipedia [2,13,15] is now the largest human-edited knowledge base of high quality. It would benefit to short text classification if we can make full use of Wikipedia information in short text classification. This paper presents a new concept based [22] on Wikipedia short text representation method, by identifying the concept of Wikipedia mentioned in short text, and then expand the concept of wiki correlation and short text messages to the feature vector representation.
引用
收藏
页码:275 / 286
页数:12
相关论文
共 50 条
  • [21] Using Wikipedia knowledge to improve text classification
    Wang, Pu
    Hu, Jian
    Zeng, Hua-Jun
    Chen, Zheng
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2009, 19 (03) : 265 - 281
  • [22] Using Wikipedia knowledge to improve text classification
    Pu Wang
    Jian Hu
    Hua-Jun Zeng
    Zheng Chen
    [J]. Knowledge and Information Systems, 2009, 19 : 265 - 281
  • [23] A Short Text Classification Method Based on Convolutional Neural Network and Semantic Extension
    Wang, Haitao
    Tian, Keke
    Wu, Zhengjiang
    Wang, Lei
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2021, 14 (01) : 367 - 375
  • [24] Chinese Multilabel Short Text Classification Method Based on GAN and Pinyin Embedding
    Bai, Jinpeng
    Li, Xinfu
    [J]. IEEE ACCESS, 2024, 12 : 83323 - 83329
  • [25] Short Text Classification Based on Keywords Extension
    Gu, Yiran
    Shen, Jiajia
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2616 - 2621
  • [26] Boosting inductive transfer for text classification using Wikipedia
    Banerjee, Somnath
    [J]. ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 148 - 153
  • [27] Embedding Wikipedia Title Based on Its Wikipedia Text and Categories
    Chen, Chi-Yen
    Ma, Wei-Yun
    [J]. 2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 146 - 149
  • [28] A Text Classification Method Based on Cascade
    Li, Hui
    Zhang, Qi
    Lu, Huchuan
    Yang, Deli
    [J]. ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 927 - +
  • [29] Text classification framework for short text based on TFIDF-FastText
    Shrutika Chawla
    Ravreet Kaur
    Preeti Aggarwal
    [J]. Multimedia Tools and Applications, 2023, 82 : 40167 - 40180
  • [30] Text classification framework for short text based on TFIDF-FastText
    Chawla, Shrutika
    Kaur, Ravreet
    Aggarwal, Preeti
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40167 - 40180