Sequential Alternating Proximal Method for Scalable Sparse Structural SVMs

被引:2
|
作者
Balamurugan, P. [1 ]
Shevade, Shirish [1 ]
Babu, T. Ravindra [2 ]
机构
[1] Indian Inst Sci, Comp Sci & Automat, Bangalore 560012, Karnataka, India
[2] Infosys Ltd, E Com Res Lab Educ & Res, Bangalore, Karnataka, India
关键词
Structural SVMs; Alternating Proximal method;
D O I
10.1109/ICDM.2012.81
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Structural Support Vector Machines (SSVMs) have recently gained wide prominence in classifying structured and complex objects like parse-trees, image segments and Part-of-Speech (POS) tags. Typical learning algorithms used in training SSVMs result in model parameters which are vectors residing in a large-dimensional feature space. Such a high-dimensional model parameter vector contains many non-zero components which often lead to slow prediction and storage issues. Hence there is a need for sparse parameter vectors which contain a very small number of non-zero components. L1-regularizer and elastic net regularizer have been traditionally used to get sparse model parameters. Though L1-regularized structural SVMs have been studied in the past, the use of elastic net regularizer for structural SVMs has not been explored yet. In this work, we formulate the elastic net SSVM and propose a sequential alternating proximal algorithm to solve the dual formulation. We compare the proposed method with existing methods for L1-regularized Structural SVMs. Experiments on large-scale benchmark datasets show that the proposed dual elastic net SSVM trained using the sequential alternating proximal algorithm scales well and results in highly sparse model parameters while achieving a comparable generalization performance. Hence the proposed sequential alternating proximal algorithm is a competitive method to achieve sparse model parameters and a comparable generalization performance when elastic net regularized Structural SVMs are used on very large datasets.
引用
收藏
页码:61 / 70
页数:10
相关论文
共 50 条
  • [1] Scalable sequential alternating proximal methods for sparse structural SVMs and CRFs
    P. Balamurugan
    Shirish Shevade
    T. Ravindra Babu
    [J]. Knowledge and Information Systems, 2014, 38 : 599 - 621
  • [2] Scalable sequential alternating proximal methods for sparse structural SVMs and CRFs
    Balamurugan, P.
    Shevade, Shirish
    Babu, T. Ravindra
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 38 (03) : 599 - 621
  • [3] Scalable Training of Sparse Linear SVMs
    Yuan, Guo-Xun
    Ma, Kwan-Liu
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 775 - 784
  • [4] Alternating proximal gradient method for sparse nonnegative Tucker decomposition
    Xu, Yangyang
    [J]. MATHEMATICAL PROGRAMMING COMPUTATION, 2015, 7 (01) : 39 - 70
  • [5] An augmented Lagrangian proximal alternating method for sparse discrete optimization problems
    Teng, Yue
    Yang, Li
    Song, Xiaoliang
    Yu, Bo
    [J]. NUMERICAL ALGORITHMS, 2020, 83 (03) : 833 - 866
  • [6] An augmented Lagrangian proximal alternating method for sparse discrete optimization problems
    Yue Teng
    Li Yang
    Xiaoliang Song
    Bo Yu
    [J]. Numerical Algorithms, 2020, 83 : 833 - 866
  • [7] Scalable Alternating Projection and Proximal Splitting for Array Pattern Synthesis
    Han, Yubing
    Wan, Chuan
    [J]. INTERNATIONAL JOURNAL OF ANTENNAS AND PROPAGATION, 2015, 2015
  • [8] Decoupling Sparse Coding with Fusion of Fisher Vectors and Scalable SVMs for Large-scale Visual Recognition
    Ji, Zhengping
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2013, : 450 - 457
  • [9] Group sparse optimization by alternating direction method
    Deng, Wei
    Yin, Wotao
    Zhang, Yin
    [J]. WAVELETS AND SPARSITY XV, 2013, 8858
  • [10] An Alternating Minimization Method for Sparse Channel Estimation
    Niazadeh, Rad
    Babaie-Zadeh, Massoud
    Jutten, Christian
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 : 319 - +