A sparse version of the ridge logistic regression for large-scale text categorization

被引:28
|
作者
Aseervatham, Sujeevan [1 ]
Antoniadis, Anestis [2 ]
Gaussier, Eric [1 ]
Burlet, Michel [3 ]
Denneulin, Yves [4 ]
机构
[1] Univ Grenoble 1, LIG, F-38041 Grenoble 9, France
[2] Univ Grenoble 1, LJK, F-38041 Grenoble 9, France
[3] Univ Grenoble 1, Lab Leibniz, F-38031 Grenoble 1, France
[4] ENSIMAG, LIG, F-38330 Montbonnot St Martin, France
关键词
Logistic regression; Model selection; Text categorization; Large scale ategorization; REGULARIZATION; SELECTION; MODEL;
D O I
10.1016/j.patrec.2010.09.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ridge logistic regression has successfully been used in text categorization problems and It has been shown to reach the same performance as the Support Vector Machine but with the main advantage of computing a probability value rather than a score However the dense solution of the ridge makes its use unpractical for large scale categorization On the other side LASSO regularization is able to produce sparse solutions but its performance is dominated by the ridge when the number of features is larger than the number of observations and/or when the features are highly correlated In this paper we propose a new model selection method which tries to approach the ridge solution by a sparse solution The method first computes the ridge solution and then performs feature selection The experimental evaluations show that our method gives a solution which is a good trade-off between the ridge and LASSO solutions (C) 2010 Elsevier B V All rights reserved
引用
收藏
页码:101 / 106
页数:6
相关论文
共 50 条
  • [41] LARGE-SCALE VISUALIZATION OF SPARSE MATRICES
    Langr, D.
    Simecek, I.
    Tvrdiki, P.
    Dytrych, T.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2014, 15 (01): : 21 - 31
  • [42] An extended Newton-type algorithm for l2-regularized sparse logistic regression and its efficiency for classifying large-scale datasets
    Wang, Rui
    Xiu, Naihua
    Zhou, Shenglong
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2021, 397
  • [43] Large Scale Image Categorization in Sparse Nonparametric Bayesian Representation
    Xing, Sun
    Yung, Nelson H. C.
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1365 - 1370
  • [44] LARGE-SCALE DEFORMATION ASSOCIATED WITH RIDGE SUBDUCTION
    GEIST, EL
    FISHER, MA
    SCHOLL, DW
    GEOPHYSICAL JOURNAL INTERNATIONAL, 1993, 115 (02) : 344 - 366
  • [45] SIMULATION OF LARGE-SCALE INDUSTRIAL AND LOGISTIC SYSTEMS
    GEISLER, M
    MANAGEMENT SCIENCE, 1959, 5 (03) : 347 - 347
  • [46] An interior-point method for large-scale l1-regularized logistic regression
    Koh, Kwangmoo
    Kim, Seung-Jean
    Boyd, Stephen
    JOURNAL OF MACHINE LEARNING RESEARCH, 2007, 8 : 1519 - 1555
  • [47] Detecting differential item functioning using generalized logistic regression in the context of large-scale assessments
    Svetina D.
    Rutkowski L.
    Large-scale Assessments in Education, 2 (1)
  • [48] Computing Leapfrog Regularization Paths with Applications to Large-Scale K-mer Logistic Regression
    Benner, Philipp
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2021, 28 (06) : 560 - 569
  • [49] Semi-Supervised Learning in Large Scale Text Categorization
    许泽文
    李建强
    刘博
    毕敬
    李蓉
    毛睿
    Journal of Shanghai Jiaotong University(Science), 2017, 22 (03) : 291 - 302
  • [50] Semi-supervised learning in large scale text categorization
    Xu Z.
    Li J.
    Liu B.
    Bi J.
    Li R.
    Mao R.
    Journal of Shanghai Jiaotong University (Science), 2017, 22 (3) : 291 - 302