Semantic Representation Based on Deep Learning for Spam Detection

被引:1
|
作者
Saidani, Nadjate [1 ]
Adi, Kamel [1 ]
Allili, Mohand Said [1 ]
机构
[1] Univ Quebec Outaouais, Dept Comp Sci & Engn, Gatineau, PQ, Canada
关键词
Spam detection; Embedding word; Domain-specific analysis; Semantic features; Deep learning; Classification;
D O I
10.1007/978-3-030-45371-8_5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the email spam filtering problem by proposing an approach based on two levels text semantic analysis. In the first level, a deep learning technique, based on Word2Vec is used to categorize emails by specific domains (e.g., health, education, finance, etc.). This enables a separate conceptual view for spams in each domain. In the second level, we extract a set of latent topics from email contents and represent them by rules to summarize the email content into compact topics discriminating spam from legitimate emails in an efficient way. The experimental study shows promising results in term of the precision of the spam detection.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 50 条
  • [1] RLOSD: Representation Learning based Opinion Spam Detection
    Sedighi, Zeinab
    Ebrahimpour-Komleh, Hossein
    Bagheri, Ayoub
    [J]. 2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 74 - 80
  • [2] Spam Image Detection Model based on Deep Learning for Improving Spam Filter
    Nam, Seong-Guk
    Lee, Dong-Gun
    Seo, Yeong-Seok
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (03): : 289 - 301
  • [3] A New Semantic Attribute Deep Learning with a Linguistic Attribute Hierarchy for Spam Detection
    He, Hongmei
    Watson, Tim
    Maple, Carsten
    Mehnen, Jorn
    Tiwari, Ashutosh
    [J]. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 3862 - 3869
  • [4] A Supervised Approach for Spam Detection Using Text-Based Semantic Representation
    Saidani, Nadjate
    Adi, Kamel
    Allili, Mouhand Said
    [J]. E-TECHNOLOGIES: EMBRACING THE INTERNET OF THINGS, MCETECH 2017, 2017, 289 : 136 - 148
  • [5] A deep learning model for Twitter spam detection
    Alom, Zulfikar
    Carminati, Barbara
    Ferrari, Elena
    [J]. Online Social Networks and Media, 2020, 18
  • [6] Spam Review Detection Using Deep Learning
    Shahariar, G. M.
    Biswas, Swapnil
    Omar, Faiza
    Shah, Faisal Muhammad
    Hassan, Samiha Binte
    [J]. 2019 IEEE 10TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON), 2019, : 27 - 33
  • [7] Learning Document Representation for Deceptive Opinion Spam Detection
    Li, Luyang
    Ren, Wenjing
    Qin, Bing
    Liu, Ting
    [J]. CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 393 - 404
  • [8] SEMANTIC EDGE DETECTION BASED ON DEEP METRIC LEARNING
    Cai, Shulian
    Huang, Jiabin
    Ding, Xinghao
    Zeng, Delu
    [J]. 2017 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS 2017), 2017, : 707 - 712
  • [9] Learning Semantic Coherence for Machine Generated Spam Text Detection
    Bao, Mengjiao
    Li, Jianxin
    Zhang, Jian
    Peng, Hao
    Liu, Xudong
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [10] A deep semantic framework for multimodal representation learning
    Cheng Wang
    Haojin Yang
    Christoph Meinel
    [J]. Multimedia Tools and Applications, 2016, 75 : 9255 - 9276