Automatic junk e-mail filtering based on latent content

被引:6
|
作者
Bellegarda, JR [1 ]
Naik, D [1 ]
Silverman, KEA [1 ]
机构
[1] Apple Comp Inc, Spoken Language Grp, Cupertino, CA 95014 USA
关键词
D O I
10.1109/ASRU.2003.1318485
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The explosion in unsolicited mass electronic mail (junk e-mail) over the past decade has sparked interest in automatic filtering solutions. Traditional techniques tend to rely on header analysis, keyword/keyphrase matching and analogous rule-based predicates, and/or some probabilistic model of text generation. This paper aims instead at deciding whether or not the latent subject matter is consistent with the user's interests. The underlying framework is latent semantic analysis: each e-mail is automatically classified against two semantic anchors, one for legitimate and one for junk messages. Experiments show that this approach is competitive with the state-of-the-art in e-mail classification, and potentially advantageous in real-world applications with high junk-to-legitimate ratios. The resulting technology has been successfully released in August 2002 as part of the e-mail client bundled with the MacOS 10.2 operating system.
引用
收藏
页码:465 / 470
页数:6
相关论文
共 50 条
  • [21] E-mail minus 'E-mail'
    Solovy, A
    HOSPITALS & HEALTH NETWORKS, 2002, 76 (11): : 26 - 26
  • [22] E-mail: what is e-mail?
    P K Downes
    British Dental Journal, 1998, 185 : 163 - 165
  • [23] Personalized filtering agent for E-mail classification
    Jeong, OR
    Cho, DS
    SAM'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SECURITY AND MANAGEMENT, VOLS 1 AND 2, 2003, : 274 - 280
  • [24] E-mail: what is e-mail?
    Downes, PK
    BRITISH DENTAL JOURNAL, 1998, 185 (04) : 163 - 165
  • [25] Incremental Learning for Interactive E-Mail Filtering
    Chen, Ding-Yi
    Li, Xue
    Dong, Zhao Yang
    Chen, Xia
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2006, 1 (02) : 60 - 78
  • [26] Personalized Filtering of Polymorphic E-mail Spam
    Takesue, Masaru
    2009 THIRD INTERNATIONAL CONFERENCE ON EMERGING SECURITY INFORMATION, SYSTEMS, AND TECHNOLOGIES, 2009, : 249 - 254
  • [27] Algorithm of E-mail classification based on automatic adapting for user
    Wang, Zhongjian
    Wang, Zongjie
    Gao, Yanfeng
    Lin, Yanfen
    International Journal of u- and e- Service, Science and Technology, 2015, 8 (02) : 235 - 242
  • [29] Experience in management of e-mail delivery delay problems associated with spam e-mail filtering in a university
    Hisanaga, Yutaka
    Sugii, Manabu
    Wang, Yue
    Osa, Atsushi
    Miike, Hidetoshi
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2012, 95 (01) : 8 - 19
  • [30] Mitigating E-Mail Threats - A Web Content Based Application
    Dhanalakshmi, R.
    Chellappan, C.
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, IMECS 2012, VOL I, 2012, : 632 - 637