Spam filtering based on online ranking logistic regression

被引:0
|
作者
机构
[1] Sun, Guanglu
[2] Qi, Haoliang
来源
Sun, G. (guanglu_sun@163.com) | 1600年 / Tsinghua University卷 / 53期
关键词
Binary classification - Classification models - Discriminative models - Logistic Regression modeling - Machine learning methods - On-line rankings - Spam - Statistical significance;
D O I
暂无
中图分类号
学科分类号
摘要
Spam filtering is an important issue in Web information processing. Many machine learning methods are utilized to filter spam. Current researches transform the filtering problem into binary classification, in which the optimization target of the classification model is not consistent with 1-AUC, the usual evaluation measurement. The inconsistence results in the deviation of model optimization, which makes a bad influence on filtering results. In this study, spam filtering was transformed into the ranking model through the optimization oriented to 1-AUC with online ranking logistic regression model then proposed to tackle the deviation of the model's score in the online learning module. TONE (train on or near error), re-sampling and weights update methods were used to promote the learning speed in online adjustment of model's parameters. Experiments on open evaluation datasets show that the developed method is better than the traditional online logistic regression model with statistical significance.
引用
收藏
相关论文
共 50 条
  • [31] Online Efficient Secure Logistic Regression based on Function Secret Sharing
    Liu, Jing
    Cui, Jamie
    Chen, Cen
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 1597 - 1606
  • [32] Filtering spam
    Editor & Publisher, 1999, (Suppl):
  • [33] Filtering spam
    Baker, B
    INTERNET WORLD, 1998, 9 (01): : 14 - 14
  • [34] Social Context Based Naive Bayes Filtering of Spam Messages from Online Social Networks
    Kiliroor, Cinu C.
    Valliyammai, C.
    SOFT COMPUTING IN DATA ANALYTICS, SCDA 2018, 2019, 758 : 699 - 706
  • [35] A Hybrid Approach to Combining CART and Logistic Regression for Stock Ranking
    Zhu, Min
    Philpotts, David
    Sparks, Ross
    Stevenson, Maxwell J.
    JOURNAL OF PORTFOLIO MANAGEMENT, 2011, 38 (01): : 100 - +
  • [36] TARGETING OF ONLINE ADVERTISING USING LOGISTIC REGRESSION
    Soltes, Erik
    Taborecka-Petrovicova, Janka
    Sipoldova, Romana
    E & M EKONOMIE A MANAGEMENT, 2020, 23 (04): : 197 - 214
  • [37] Efficient Methods for Online Multiclass Logistic Regression
    Agarwal, Naman
    Kale, Satyen
    Zimmert, Julian
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [38] Low Time Complexity Model for Email Spam Detection using Logistic Regression
    Mrisho, Zubeda K.
    Sam, Anael Elkana
    Ndibwile, Jema David
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (12) : 112 - 118
  • [39] SPAM FILTERING SYSTEM BASED ON NB ALGORITHM
    Wen, Wanru
    INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE & TECHNOLOGY, PROCEEDINGS, 2009, : 122 - 124
  • [40] Characteristics Sharing Based Spam Filtering Method
    Long, Chengzhi
    Zhou, Jie
    CONFERENCE ON WEB BASED BUSINESS MANAGEMENT, VOLS 1-2, 2010, : 899 - 903