Semantic Representation Based on Deep Learning for Spam Detection

被引:1
|
作者
Saidani, Nadjate [1 ]
Adi, Kamel [1 ]
Allili, Mohand Said [1 ]
机构
[1] Univ Quebec Outaouais, Dept Comp Sci & Engn, Gatineau, PQ, Canada
关键词
Spam detection; Embedding word; Domain-specific analysis; Semantic features; Deep learning; Classification;
D O I
10.1007/978-3-030-45371-8_5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the email spam filtering problem by proposing an approach based on two levels text semantic analysis. In the first level, a deep learning technique, based on Word2Vec is used to categorize emails by specific domains (e.g., health, education, finance, etc.). This enables a separate conceptual view for spams in each domain. In the second level, we extract a set of latent topics from email contents and represent them by rules to summarize the email content into compact topics discriminating spam from legitimate emails in an efficient way. The experimental study shows promising results in term of the precision of the spam detection.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 50 条
  • [31] Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition
    Gupta, Shikha
    Sharma, Krishan
    Dinesh, Dileep Aroor
    Thenkanidiyoor, Veena
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (02)
  • [32] A novel deep learning model-based optimization algorithm for text message spam detection
    Das, Lipsa
    Ahuja, Laxmi
    Pandey, Adesh
    [J]. JOURNAL OF SUPERCOMPUTING, 2024, 80 (12): : 17823 - 17848
  • [33] Visualization Technology and Deep-Learning for Multilingual Spam Message Detection
    Lee, Hwabin
    Jeong, Sua
    Cho, Seogyeong
    Choi, Eunjung
    [J]. ELECTRONICS, 2023, 12 (03)
  • [34] Replacing Human Input in Spam Email Detection Using Deep Learning
    Nicho, Mathew
    Majdani, Farzan
    McDermott, Christopher D.
    [J]. ARTIFICIAL INTELLIGENCE IN HCI, AI-HCI 2022, 2022, 13336 : 387 - 404
  • [35] DeepCapture: Image Spam Detection Using Deep Learning and Data Augmentation
    Kim, Bedeuro
    Abuadbba, Sharif
    Kim, Hyoungshick
    [J]. INFORMATION SECURITY AND PRIVACY, ACISP 2020, 2020, 12248 : 461 - 475
  • [36] Spam SMS Detection for Turkish Language with Deep Text Analysis and Deep Learning Methods
    Onur Karasoy
    Serkan Ballı
    [J]. Arabian Journal for Science and Engineering, 2022, 47 : 9361 - 9377
  • [37] PhishTrim: Fast and adaptive phishing detection based on deep representation learning
    Zhang, Lei
    Zhang, Peng
    [J]. 2020 IEEE 13TH INTERNATIONAL CONFERENCE ON WEB SERVICES (ICWS 2020), 2020, : 176 - 180
  • [38] ABNORMAL CROWD BEHAVIOUR DETECTION BASED ON DEEP LEARNING AND SPARSE REPRESENTATION
    Gai, Zhendi
    Liu, Dongmei
    Chang, Faliang
    Li, Nanjun
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2020, 35 (04): : 322 - 331
  • [39] Unsupervised learning trajectory anomaly detection algorithm based on deep representation
    Wang, Zhongqiu
    Yuan, Guan
    Pei, Haoran
    Zhang, Yanmei
    Liu, Xiao
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2020, 16 (12):
  • [40] Learning Document Semantic Representation with Hybrid Deep Belief Network
    Yan, Yan
    Yin, Xu-Cheng
    Li, Sujian
    Yang, Mingyuan
    Hao, Hong-Wei
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015