Boosting RVM classifiers for large data sets

被引:0
|
作者
Silva, Catarina [1 ,2 ]
Ribeir, Bernardete [2 ]
Sung, Andrew H. [3 ]
机构
[1] Pol Inst Leira, Sch Technol & Management, Leira, Portugal
[2] Univ Coimbra, Dept Informat Engn, Ctr Informat & Syst, P-3000 Coimbra, Portugal
[3] New Mexico Inst Min & Technol, Inst Comp Addit Sys Anal, Dept Comp Sci, Socorro, NM USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relevance Vector Machines (RVM) extend Support Vector Machines (SVM) to have probabilistic interpretations, to build sparse training models with fewer basis functions (i.e., relevance vectors or prototypes), and to realize Bayesian learning by placing priors over parameters (i.e., introducing hyperparameters). However, RVM algorithms do not scale up to large data sets. To overcome this problem, in this paper we propose a RVM boosting algorithm and demonstrate its potential with a text mining application. The idea is to build weaker classifiers, and then improve overall accuracy by using a boosting technique for document classification. The algorithm proposed is able to incorporate all the training data available; when combined with sampling techniques for choosing the working set, the boosted learning machine is able to attain high accuracy. Experiments on REUTERS benchmark show that the results achieve competitive accuracy against state-of-the-art SVM; meanwhile, the sparser solution found allows real-time implementations.
引用
收藏
页码:228 / +
页数:2
相关论文
共 50 条
  • [21] Boosting support vector machines for imbalanced data sets
    Benjamin X. Wang
    Nathalie Japkowicz
    [J]. Knowledge and Information Systems, 2010, 25 : 1 - 20
  • [22] Remote Sensing Data Binary Classification Using Boosting with Simple Classifiers
    Nowakowski, Artur
    [J]. ACTA GEOPHYSICA, 2015, 63 (05): : 1447 - 1462
  • [23] Boosting for Straggling and Flipping Classifiers
    Cassuto, Yuval
    Kim, Yongjune
    [J]. 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 2441 - 2446
  • [24] Multiclass boosting for weak classifiers
    Eibl, G
    Pfeiffer, KP
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2005, 6 : 189 - 210
  • [25] Boosting classifiers for drifting concepts
    Scholz, Martin
    Klinkenberg, Ralf
    [J]. INTELLIGENT DATA ANALYSIS, 2007, 11 (01) : 3 - 28
  • [26] Boosting recombined weak classifiers
    Rodriguez, Juan J.
    Maudes, Jesus
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (08) : 1049 - 1059
  • [27] Boosting with diverse base classifiers
    Dasgupta, S
    Long, PM
    [J]. LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 273 - 287
  • [28] Multiclass boosting for weak classifiers
    Eibl, Günther
    Pfeiffer, Karl-Peter
    [J]. Journal of Machine Learning Research, 2005, 6
  • [29] Evaluation Protocol of Early Classifiers over Multiple Data Sets
    Dachraoui, Asma
    Bondu, Alexis
    Cornuejols, Antoine
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 548 - 555
  • [30] An empirical study of the behavior of classifiers on imbalanced and overlapped data sets
    Garcia, Vicente
    Sanchez, Jose
    Mollineda, Ramon
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2007, 4756 : 397 - +