Scaling text classification with relevance vector machines

被引:13
|
作者
Silva, Catarina [1 ]
Ribeiro, Bemardete [2 ]
机构
[1] Polytech Inst Leiria, Sch Technol & Management, P-2411901 Leiria, Portugal
[2] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
关键词
D O I
10.1109/ICSMC.2006.384791
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text classification (TC) is a complex ubiquitous task that handles a huge amount of data. Current research has recently proved that kernel learning based methods are quite effective in this problem. As opposed to Support Vector Machines (SVM), the Relevance Vector Machine (RVM) in particular yields a probabilistic output while preserving its accuracy. However, few research efforts have addressed the issue of scalability that arises when applying RVM to large scale problems like TC. We propose a new model which consists of a two-step RVM classifier able to (i) be competitive regarding processing time, (ii) use all available training elements and (iii) improve RVM classification performance. The paper also shows that a convenient similitude measure among documents can be defined on all the collection data, which does not only make the process swifter but also parallelizable. Using REUTERS-21578, we show that deployment of successful real-time applications is possible through reduction of the computational complexity and improvement of overall performance, obtained by the proposed model.
引用
收藏
页码:4186 / +
页数:2
相关论文
共 50 条
  • [1] Combining active learning and relevance vector machines for text classification
    Silva, C.
    Ribeiro, B.
    ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 130 - +
  • [2] Scaling feature selection method for enhancing the classification performance of Support Vector Machines in text mining
    Manochandar, S.
    Punniyamoorthy, M.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 124 : 139 - 156
  • [3] Hyperspectral image classification with Mahalanobis Relevance Vector Machines
    Camps-Valls, Gustavo
    Rodrigo-Gonzalez, Antonio
    Munoz-Mari, Jordi
    Gomez-Chova, Luis
    Calpe-Maravilla, Javier
    IGARSS: 2007 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-12: SENSING AND UNDERSTANDING OUR PLANET, 2007, : 3802 - 3805
  • [4] Hyperspectral image classification using relevance vector machines
    Demir, Beguem
    Erturk, Sarp
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2007, 4 (04) : 586 - 590
  • [5] Exploring Relevance Vector Machines for Faster Pedestrian Classification
    Serra-Toro, Carlos
    Javier Traver, V.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2013, 2013, 7887 : 509 - 516
  • [6] Relevance Vector Machines: Sparse Classification Methods for QSAR
    Burden, Frank R.
    Winkler, David A.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (08) : 1529 - 1534
  • [7] Virtual examples for text classification with support vector machines
    Sassano, M
    PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2003, : 208 - 215
  • [9] Hierarchical text classification based on support vector machines
    Jin, Ting
    Lei, Jingsheng
    Journal of Information and Computational Science, 2009, 6 (01): : 543 - 551
  • [10] Text classification of news articles with support vector machines
    Paass, G
    Kindermann, J
    Leopold, E
    TEXT MINING AND ITS APPLICATIONS, 2004, 138 : 53 - 64