Learning Semantic Coherence for Machine Generated Spam Text Detection

被引:6
|
作者
Bao, Mengjiao [1 ]
Li, Jianxin [1 ]
Zhang, Jian [1 ]
Peng, Hao [1 ]
Liu, Xudong [1 ]
机构
[1] Beihang Univ, SKLSDE Lab, Beijing Adv Innovat Ctr Big Data & Brain Comp, Beijing, Peoples R China
关键词
Deep Learning; Neural Network; Semantics; Text Classification;
D O I
10.1109/ijcnn.2019.8852340
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using machine to generate text has attracted considerable attention recently. However, low quality text generated by machine will seriously impact the user experience due to the poor readability. Traditional methods for detecting machine generated text heavily depend on hand-crafted features. While most deep learning methods for general text classification tend to model the semantic representation of topics, and thus overlook the semantic coherence that is also useful for detecting machine generated text. In this paper, we propose an end-to-end neural architecture that learns semantic coherence of text sequences. We conduct experiments on both Chinese and English datasets with more than two million articles containing manually written and machine generated ones. Results show that our method is effective and achieves the state-of-the-art performance.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Spam Detection in Social Media Employing Machine Learning Tool for Text Mining
    Zaman, Zakia
    Sharmin, Sadia
    [J]. 2017 13TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS (SITIS), 2017, : 137 - 142
  • [2] Efficient spam filtering through intelligent text modification detection using machine learning
    Mageshkumar, N.
    Vijayaraj, A.
    Arunpriya, N.
    Sangeetha, A.
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 848 - 858
  • [3] Efficient spam filtering through intelligent text modification detection using machine learning
    Mageshkumar, N.
    Vijayaraj, A.
    Arunpriya, N.
    Sangeetha, A.
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 64 : 848 - 858
  • [4] Semantic Representation Based on Deep Learning for Spam Detection
    Saidani, Nadjate
    Adi, Kamel
    Allili, Mohand Said
    [J]. FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2019, 2020, 12056 : 72 - 81
  • [5] Comparison of machine learning techniques for spam detection
    Ghosh, Argha
    Senthilrajan, A.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29227 - 29254
  • [6] Machine Learning for the Detection of Spam in Twitter Networks
    Wang, Alex Hai
    [J]. E-BUSINESS AND TELECOMMUNICATIONS, 2012, 222 : 319 - 333
  • [7] A Study of Machine Learning Classifiers for Spam Detection
    Trivedi, Shrawan Kumar
    [J]. 2016 4TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL AND BUSINESS INTELLIGENCE (ISCBI), 2016, : 176 - 180
  • [8] Comparison of Machine Learning Algorithms for Spam Detection
    Sadia, Azeema
    Bashir, Fatima
    Khan, Reema Qaiser
    Bashir, Amna
    Khalid, Ammarah
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (02) : 178 - 184
  • [9] Comparison of machine learning techniques for spam detection
    Argha Ghosh
    A. Senthilrajan
    [J]. Multimedia Tools and Applications, 2023, 82 : 29227 - 29254
  • [10] Review Spam Detection using Machine Learning
    Radovanovic, Drasko
    Krstajic, Boza
    [J]. 2018 23RD INTERNATIONAL SCIENTIFIC-PROFESSIONAL CONFERENCE ON INFORMATION TECHNOLOGY (IT), 2018,