A Study of the Chinese spam Classification with Doc2vec and CNN

被引:0
|
作者
Gong, Hechen [1 ]
You, Fucheng [1 ]
Wang, Shaomei [1 ]
机构
[1] Beijing Inst Graph Commun, Sch Informat Engn, Beijing 102600, Peoples R China
关键词
D O I
10.1088/1757-899X/563/4/042026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolution neural network is a kind of neural network, which has been proved to be very effective in image recognition and classification. In recent years, convolution neural networks have gradually shifted to the field of natural language processing and become one of the research hotspots. For the construction of word vector text using convolution neural network, only considering the relationship between word granularity level, not considering the relationship between words, nor considering the relationship between semantics, affecting the classification results. In this paper, a method based on Doc2vec and CNN is proposed to classify spam. Firstly, the spam is preprocessed, then the sentence vectors and word vectors of Chinese text are trained by Doc2vec, and finally the trained text vectors are classified by convolution neural network.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Chinese abstraction algorithm combining Doc2Vec and TextRank
    Mou, Jinjun
    Xiong, Zhibin
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 149 - 149
  • [2] Chinese Text Keyword Extraction Based on Doc2vec And TextRank
    Wang, Wei
    Li, Xiangshun
    Yu, Sheng
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 369 - 373
  • [3] Compressed Firmware Classification Based on Extra Trees and Doc2Vec
    Qiu, Jing
    Geng, Xiaoxu
    Sun, Guanglu
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [4] Sentiment Analysis on Chinese Hotel Reviews with Doc2Vec and Classifiers
    Shuai, Qianjun
    Huang, Yamei
    Jin, Libiao
    Pang, Long
    [J]. PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1171 - 1174
  • [5] Micro-blog sentiment classification using Doc2vec
    Liang, Yinghong
    Liu, Haitao
    Zhang, Su
    [J]. JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 407 - 410
  • [6] Topic recommendation using Doc2Vec
    Karvelis, Petros
    Gavrilis, Dimitris
    Georgoulas, George
    Stylios, Chrysostomos
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] Classification of Customer Demands by Using Doc2Vec Feaure Extraction Method
    Arslan, Halil
    Kaynar, Oguz
    Sahin, Sumeyye
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [8] Web services classification via combining Doc2Vec and LINE model
    Ye, Hongfan
    Cao, Buqing
    Geng, Jinkun
    Wen, Yiping
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 23 (03) : 250 - 261
  • [9] Bangla news recommendation using doc2vec
    Nandi, Rabindra Nath
    Zaman, M. M. Arefin
    Al Muntasir, Tareq
    Sumit, Sakhawat Hosain
    Sourov, Tanvir
    Rahman, Md. Jamil-Ur
    [J]. 2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [10] Deep Learning Based Classification Using Academic Studies in Doc2Vec Model
    Safali, Yasar
    Nergiz, Gozde
    Avaroglu, Erdinc
    Dogan, Emre
    [J]. 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,