Compression-based spam filter

被引:6
|
作者
Almeida, Tiago A. [1 ]
Yamakami, Akebo [2 ]
机构
[1] Fed Univ Sao Carlos UFSCar, Dept Comp Sci, BR-18052780 Sorocaba, SP, Brazil
[2] Univ Campinas UNICAMP, Sch Elect & Comp Engn, BR-13083970 Campinas, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
compression-based model; spam filter; text categorization; knowledge-based system; machine learning; CLASSIFICATION;
D O I
10.1002/sec.639
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, e-mail spam is not a novelty, but it is still an important problem with a high impact on the economy. Spam filtering poses a special problem in text categorization, in which the defining characteristic is that filters face an active adversary, which constantly attempts to evade filtering. In this paper, we present a novel approach to spam filtering based on a compression-based model. We have conducted an empirical experiment on eight public and real non-encoded datasets. The results indicate that the proposed filter is fast to construct, is incrementally updateable, and clearly outperforms established spam classifiers. Copyright (c) 2012 John Wiley & Sons, Ltd.
引用
下载
收藏
页码:327 / 335
页数:9
相关论文
共 50 条
  • [1] Compression-based steganography
    Carpentieri, Bruno
    Castiglione, Arcangelo
    De Santis, Alfredo
    Palmieri, Francesco
    Pizzolante, Raffaele
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (08):
  • [2] Compression-based image registration
    Bardera, Anton
    Feixas, Miquel
    Boada, Imma
    Sbert, Mateu
    2006 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, VOLS 1-6, PROCEEDINGS, 2006, : 436 - +
  • [3] Compression-Based Compressed Sensing
    Rezagah, Farideh E.
    Jalali, Shirin
    Erkip, Elza
    Poor, H. Vincent
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2017, 63 (10) : 6735 - 6752
  • [4] Compression-based AODE Classifiers
    Corani, G.
    Antonucci, A.
    De Rosa, R.
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 264 - +
  • [5] Compression-based Facies Modelling
    Manzocchi, Tom
    Walsh, Deirdre A.
    Carneiro, Marcus
    Lopez-Cabrera, Javier
    MATHEMATICAL GEOSCIENCES, 2023, 55 (05) : 625 - 644
  • [6] Compression-based Facies Modelling
    Tom Manzocchi
    Deirdre A. Walsh
    Marcus Carneiro
    Javier López-Cabrera
    Mathematical Geosciences, 2023, 55 : 625 - 644
  • [7] Wavelet filter for improving detection performance of compression-based joint transform correlator
    Widjaja, Joewono
    APPLIED OPTICS, 2010, 49 (30) : 5768 - 5776
  • [8] On compression-based text classification
    Marton, Y
    Wu, N
    Hellerstein, L
    ADVANCES IN INFORMATION RETRIEVAL, 2005, 3408 : 300 - 314
  • [9] A compression-based distance measure for texture
    Campana B.J.L.
    Keogh E.J.
    Statistical Analysis and Data Mining, 2010, 3 (06): : 381 - 398
  • [10] A Compression-Based Method for Stemmatic Analysis
    Roos, Teemu
    Heikkila, Tuomas
    Myllymaki, Petri
    ECAI 2006, PROCEEDINGS, 2006, 141 : 805 - +