Learning Semantic Coherence for Machine Generated Spam Text Detection

被引:6
|
作者
Bao, Mengjiao [1 ]
Li, Jianxin [1 ]
Zhang, Jian [1 ]
Peng, Hao [1 ]
Liu, Xudong [1 ]
机构
[1] Beihang Univ, SKLSDE Lab, Beijing Adv Innovat Ctr Big Data & Brain Comp, Beijing, Peoples R China
关键词
Deep Learning; Neural Network; Semantics; Text Classification;
D O I
10.1109/ijcnn.2019.8852340
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using machine to generate text has attracted considerable attention recently. However, low quality text generated by machine will seriously impact the user experience due to the poor readability. Traditional methods for detecting machine generated text heavily depend on hand-crafted features. While most deep learning methods for general text classification tend to model the semantic representation of topics, and thus overlook the semantic coherence that is also useful for detecting machine generated text. In this paper, we propose an end-to-end neural architecture that learns semantic coherence of text sequences. We conduct experiments on both Chinese and English datasets with more than two million articles containing manually written and machine generated ones. Results show that our method is effective and achieves the state-of-the-art performance.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Detection of Machine-Generated Text: Literature Survey
    University of Arkansas at Little Rock, United States
    [J]. arXiv,
  • [22] Spam SMS filtering based on text features and supervised machine learning techniques
    Muhammad Adeel Abid
    Saleem Ullah
    Muhammad Abubakar Siddique
    Muhammad Faheem Mushtaq
    Wajdi Aljedaani
    Furqan Rustam
    [J]. Multimedia Tools and Applications, 2022, 81 : 39853 - 39871
  • [23] Spam SMS filtering based on text features and supervised machine learning techniques
    Abid, Muhammad Adeel
    Ullah, Saleem
    Siddique, Muhammad Abubakar
    Mushtaq, Muhammad Faheem
    Aljedaani, Wajdi
    Rustam, Furqan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 39853 - 39871
  • [24] Comparative Study of Machine Learning Algorithms for SMS Spam Detection
    Alzahrani, Amani
    Rawat, Danda B.
    [J]. 2019 IEEE SOUTHEASTCON, 2019,
  • [25] Detection of Spam E-mails with Machine Learning Methods
    Karamollaoglu, Hamdullah
    Dogru, Ibrahim Alper
    Dorterler, Murat
    [J]. 2018 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2018, : 55 - 59
  • [26] SMS spam detection and comparison of various machine learning algorithms
    Sethi, Paras
    Bhandari, Vaibhav
    Kohli, Bhavna
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES FOR SMART NATION (IC3TSN), 2017, : 28 - 31
  • [27] An Efficient Email Spam Detection Utilizing Machine Learning Approaches
    Kumar, G. Ravi
    Murthuja, P.
    Babu, G. Anjan
    Nagamani, K.
    [J]. INNOVATIVE DATA COMMUNICATION TECHNOLOGIES AND APPLICATION, ICIDCA 2021, 2022, 96 : 141 - 151
  • [28] MACHINE LEARNING BASED TWITTER SPAM ACCOUNT DETECTION: A REVIEW
    Gheewala, Shivangi
    Patel, Rakesh
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 79 - 84
  • [29] Using machine learning to deal with Phishing and Spam Detection: An overview
    El Kouari, Oumaima
    Benaboud, Hafssa
    Lazaar, Saiida
    [J]. 3RD INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEM & SECURITY (NISS'20), 2020,
  • [30] Survey of review spam detection using machine learning techniques
    Crawford M.
    Khoshgoftaar T.M.
    Prusa J.D.
    Richter A.N.
    Al Najada H.
    [J]. Journal of Big Data, 2 (1)