Confidence-Weighted Linear Classification for Text Categorization

被引:0
|
作者
Crammer, Koby [1 ]
Dredze, Mark [2 ]
Pereira, Fernando [3 ]
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
[2] Johns Hopkins Univ, Human Language Technol Ctr Excellence, Baltimore, MD 21211 USA
[3] Google Inc, Mountain View, CA 94043 USA
关键词
online learning; confidence prediction; text categorization; PERCEPTRON;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Confidence-weighted online learning is a generalization of margin-based learning of linear classifiers in which the margin constraint is replaced by a probabilistic constraint based on a distribution over classifier weights that is updated online as examples are observed. The distribution captures a notion of confidence on classifier weights, and in some cases it can also be interpreted as replacing a single learning rate by adaptive per-weight rates. Confidence-weighted learning was motivated by the statistical properties of natural-language classification tasks, where most of the informative features are relatively rare. We investigate several versions of confidence-weighted learning that use a Gaussian distribution over weight vectors, updated at each observed example to achieve high probability of correct classification for the example. Empirical evaluation on a range of text-categorization tasks show that our algorithms improve over other state-of-the-art online and batch methods, learn faster in the online setting, and lead to better classifier combination for a type of distributed training commonly used in cloud computing.
引用
收藏
页码:1891 / 1926
页数:36
相关论文
共 50 条
  • [1] Distributed Confidence-Weighted Classification on MapReduce
    Djuric, Nemanja
    Grbovic, Mihajlo
    Vucetic, Slobodan
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [2] Robust Cost-Sensitive Confidence-Weighted Classification
    Ali, Alnur
    Collins-Thompson, Kevyn
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 742 - 749
  • [3] An Improved Online Multiclass Classification Algorithm Based on Confidence-Weighted
    Hu, Ji
    Yan, Chenggang
    Zhang, Jiyong
    Peng, Dongliang
    Ren, Chengwei
    Yang, Shengying
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (06) : 840 - 849
  • [4] Soft Confidence-Weighted Learning
    Wang, Jialei
    Zhao, Peilin
    Hoi, Steven C. H.
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2016, 8 (01)
  • [5] Confidence-weighted Learning for Feature Evolution
    Liu Y.-F.
    Li W.-B.
    Gao Y.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (04): : 1315 - 1325
  • [6] Brain dynamics for confidence-weighted learning
    Meyniel, Florent
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (06)
  • [7] On the learning benefits of confidence-weighted testing
    Sparck E.M.
    Bjork E.L.
    Bjork R.A.
    Cognitive Research: Principles and Implications, 1 (1)
  • [8] Budgeted Algorithm for Linearized Confidence-Weighted Learning
    Lin, Yu-Shiou
    Wu, Chung-Hao
    Lu, Henry Horng-Shing
    PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON CLOUD AND BIG DATA COMPUTING (ICCBDC 2019), 2019, : 6 - 10
  • [9] Confidence-weighted bias model for online collaborative filtering
    Zhou, Xiuze
    Shu, Weibo
    Lin, Fan
    Wang, Beizhan
    APPLIED SOFT COMPUTING, 2018, 70 : 1042 - 1053
  • [10] Confidence-weighted extreme learning machine for regression problems
    Shang, Zhigen
    He, Jianqiang
    NEUROCOMPUTING, 2015, 148 : 544 - 550