Microblogging Short Text Classification based on Word2Vec

被引:0
|
作者
Zhang, Yonghui [1 ]
Liu, Jingang [1 ,2 ]
机构
[1] Capital Normal Univ, Beijing 100048, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing 100089, Peoples R China
关键词
Word2Vec; Features extension; Microblogging short text; SVM; Classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the sparse features of the microblogging text, the author proposes a method of microblogging text classification based on the features extension by Word2Vec. We train the text by using Word2Vec tool and find the words which are similar to original features semantic as the features of short text. Then we expand the features to the original text, and finally classify the subject of microblogging text by using SVM method. Experimental results show that the method has high accuracy recall and F1 values compared with the traditional method of vector space model and LDA topic model.
引用
收藏
页码:395 / 401
页数:7
相关论文
共 50 条
  • [1] Short Text Classification Based on Wikipedia and Word2vec
    Liu Wensen
    Cao Zewen
    Wang Jun
    Wang Xiaoyi
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1195 - 1200
  • [2] Research on Chinese Text Classification Based on Word2vec
    Yang, Zhi-Tong
    Zheng, Jun
    [J]. 2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1166 - 1170
  • [3] Feature Extension for Chinese Short Text Classification Based on LDA and Word2vec
    Sun, Fanke
    Chen, Heping
    [J]. PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 1189 - 1194
  • [4] Duplicate Short Text Detection Based on Word2vec
    Gao, Jin
    He, Yahao
    Zhang, Xiaoyan
    Xia, Yamei
    [J]. PROCEEDINGS OF 2017 8TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2017), 2017, : 33 - 37
  • [5] Text Classification Based on Word2vec and Convolutional Neural Network
    Li, Lin
    Xiao, Linlong
    Jin, Wenzhen
    Zhu, Hong
    Yang, Guocai
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 450 - 460
  • [6] Text Classification Research Based on Improved Word2vec and CNN
    Gao, Mengyuan
    Li, Tinghui
    Huang, Peifang
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2018, 2019, 11434 : 126 - 135
  • [7] Diet Health Text Classification Based on word2vec and LSTM
    Zhao M.
    Du H.
    Dong C.
    Chen C.
    [J]. Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2017, 48 (10): : 202 - 208
  • [8] Research on patent text classification based on Word2Vec and LSTM
    Xiao, Lizhong
    Wang, Guangzhong
    Zuo, Yang
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 1, 2018, : 71 - 74
  • [9] Text classification based on word2vec and convolutional neural networks
    Fan, Xiaojing
    Jiang, Mingyang
    Pei, Zhili
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 77 - 78
  • [10] Text classification model based on Word2vec and SF-HAN
    Li, Zhien
    Rao, Zhuyi
    [J]. PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1385 - 1390