Automatic junk e-mail filtering based on latent content

被引:6
|
作者
Bellegarda, JR [1 ]
Naik, D [1 ]
Silverman, KEA [1 ]
机构
[1] Apple Comp Inc, Spoken Language Grp, Cupertino, CA 95014 USA
关键词
D O I
10.1109/ASRU.2003.1318485
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The explosion in unsolicited mass electronic mail (junk e-mail) over the past decade has sparked interest in automatic filtering solutions. Traditional techniques tend to rely on header analysis, keyword/keyphrase matching and analogous rule-based predicates, and/or some probabilistic model of text generation. This paper aims instead at deciding whether or not the latent subject matter is consistent with the user's interests. The underlying framework is latent semantic analysis: each e-mail is automatically classified against two semantic anchors, one for legitimate and one for junk messages. Experiments show that this approach is competitive with the state-of-the-art in e-mail classification, and potentially advantageous in real-world applications with high junk-to-legitimate ratios. The resulting technology has been successfully released in August 2002 as part of the e-mail client bundled with the MacOS 10.2 operating system.
引用
收藏
页码:465 / 470
页数:6
相关论文
共 50 条
  • [1] Content Based Spam E-mail Filtering
    Liu, Pingchuan
    Moh, Teng-Sheng
    2016 INTERNATIONAL CONFERENCE ON COLLABORATION TECHNOLOGIES AND SYSTEMS (CTS), 2016, : 218 - 224
  • [2] Junk e-mail filtering model based on source address restraints
    Liang, Li
    Yan, Jianwei
    Nie, Ying
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2005, 39 (04): : 376 - 379
  • [3] Is e-mail turning into junk mail?
    Heney, PJ
    HYDRAULICS & PNEUMATICS, 1996, 49 (10) : 4 - 4
  • [4] Collaborative junk e-mail filtering based on multi-agent systems
    Jung, JJ
    Jo, GS
    WEB AND COMMUNICATION TECHNOLOGIES AND INTERNET-RELATED SOCIAL ISSUES - HSI 2003, 2003, 2713 : 218 - 227
  • [5] Putting the lid on junk e-mail
    Dunlop, A
    INTERNET WORLD, 1997, 8 (09): : 15 - 16
  • [6] Junk e-mail 'is not free speech'
    Kleiner, K
    NEW SCIENTIST, 1996, 152 (2057) : 14 - 14
  • [7] AOL: Essential for sending junk e-mail?
    Stern, RH
    IEEE MICRO, 1997, 17 (02) : 7 - 8
  • [8] A neural network classifier for junk e-mail
    Stuart, I
    Cha, SH
    Tappert, C
    DOCUMENT ANALYSIS SYSTEMS VI, PROCEEDINGS, 2004, 3163 : 442 - 450
  • [9] SURFERS STEM TIDE OF JUNK E-MAIL
    不详
    NEW SCIENTIST, 1995, 148 (2003) : 5 - 5
  • [10] Using E-mail Authentication and Disposable E-mail Addressing for Filtering Spam
    Luo, Jia-Ning
    Yang, Ming Hour
    2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 356 - +