Microblogging Short Text Classification based on Word2Vec

被引:0
|
作者
Zhang, Yonghui [1 ]
Liu, Jingang [1 ,2 ]
机构
[1] Capital Normal Univ, Beijing 100048, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing 100089, Peoples R China
关键词
Word2Vec; Features extension; Microblogging short text; SVM; Classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the sparse features of the microblogging text, the author proposes a method of microblogging text classification based on the features extension by Word2Vec. We train the text by using Word2Vec tool and find the words which are similar to original features semantic as the features of short text. Then we expand the features to the original text, and finally classify the subject of microblogging text by using SVM method. Experimental results show that the method has high accuracy recall and F1 values compared with the traditional method of vector space model and LDA topic model.
引用
收藏
页码:395 / 401
页数:7
相关论文
共 50 条
  • [41] WEIGHTED WORD2VEC BASED ON THE DISTANCE OF WORDS
    Chang, Chia-Yang
    Lee, Shie-Jue
    Lai, Chih-Chin
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2017, : 563 - 568
  • [42] Keywords Extraction Based on Word2Vec and TextRank
    Zhang, Yong
    Chen, Fen
    Zhang, Wufeng
    Zuo, Haoyang
    Yu, Fangyuan
    [J]. 2020 3RD INTERNATIONAL CONFERENCE ON BIG DATA AND EDUCATION (ICBDE 2020), 2020, : 37 - 42
  • [43] The Spectral Underpinning of word2vec
    Jaffe, Ariel
    Kluger, Yuval
    Lindenbaum, Ofir
    Patsenker, Jonathan
    Peterfreund, Erez
    Steinerberger, Stefan
    [J]. FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2020, 6
  • [44] Emerging Trends Word2Vec
    Church, Kenneth Ward
    [J]. NATURAL LANGUAGE ENGINEERING, 2017, 23 (01) : 155 - 162
  • [45] KEYWORD EXTRACTION BASED ON WORD SYNONYMS USING WORD2VEC
    Ogul, Iskender Ulgen
    Ozcan, Caner
    Hakdagli, Ozlem
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [46] Personal Trait Analysis Using Word2vec Based on User-generated Text
    Sun, Guanqun
    Guo, Ao
    Ma, Jianhua
    Wei, Jianguo
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 1131 - 1137
  • [47] Polarity Classification for Target Phrases in Tweets: A Word2Vec Approach
    Rexha, Andi
    Kroell, Mark
    Dragoni, Mauro
    Kern, Roman
    [J]. SEMANTIC WEB, ESWC 2016, 2016, 9989 : 217 - 223
  • [48] Classification Turkish SMS with Deep Learning Tool Word2Vec
    Karasoy, Onur
    Balli, Serkan
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 294 - 297
  • [49] A Study on Sentiment Computing and Classification of Sina Weibo with Word2vec
    Bai Xue
    Chen Fu
    Zhan Shaobin
    [J]. 2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 358 - 363
  • [50] Word2vec for Arabic Word Sense Disambiguation
    Laatar, Rim
    Aloulou, Chafik
    Belghuith, Lamia Hadrich
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 308 - 311