Large-Scale Learning with Structural Kernels for Class-Imbalanced Datasets

被引:0
|
作者
Severyn, Aliaksei [1 ]
Moschitti, Alessandro [1 ]
机构
[1] Univ Trento, Dept Comp Sci & Engn, I-38123 Povo, TN, Italy
来源
ETERNAL SYSTEMS | 2012年 / 255卷
关键词
Machine Learning; Kernel Methods; Structural Kernels; Support Vector Machine; Natural Language Processing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Much of the success in machine learning can be attributed to the ability of learning methods to adequately represent, extract, and exploit inherent structure present in the data under interest. Kernel methods represent a rich family of techniques that harvest on this principle. Domain-specific kernels are able to exploit rich structural information present in the input data to deliver state of the art results in many application areas, e.g. natural language processing (NLP), bio-informatics, computer vision and many others. The use of kernels to capture relationships in the input data has made Support Vector Machine (SVM) algorithm the state of the art tool in many application areas. Nevertheless, kernel learning remains a computationally expensive process. The contribution of this paper is to make learning with structural kernels, e.g. tree kernels, more applicable to real-world large-scale tasks. More specifically, we propose two important enhancements of the approximate cutting plane algorithm to train Support Vector Machines with structural kernels: (i) a new sampling strategy to handle class-imbalanced problem; and (ii) a parallel implementation, which makes the training scale almost linearly with the number of CPUs. We also show that theoretical convergence bounds are preserved for the improved algorithm. The experimental evaluations demonstrate the soundness of our approach and the possibility to carry out large-scale learning with structural kernels.
引用
下载
收藏
页码:34 / 41
页数:8
相关论文
共 50 条
  • [1] SGBGAN: minority class image generation for class-imbalanced datasets
    Wan, Qian
    Guo, Wenhui
    Wang, Yanjiang
    MACHINE VISION AND APPLICATIONS, 2024, 35 (02)
  • [2] Large-Scale Support Vector Learning with Structural Kernels
    Severyn, Aliaksei
    Moschitti, Alessandro
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2010, 6323 : 229 - 244
  • [3] Class-imbalanced semi-supervised learning for large-scale point cloud semantic segmentation via decoupling optimization
    Li, Mengtian
    Lin, Shaohui
    Wang, Zihan
    Shen, Yunhang
    Zhang, Baochang
    Ma, Lizhuang
    PATTERN RECOGNITION, 2024, 156
  • [4] SGBGAN: minority class image generation for class-imbalanced datasets
    Qian Wan
    Wenhui Guo
    Yanjiang Wang
    Machine Vision and Applications, 2024, 35
  • [6] Prediction of Adult Chronic Kidney Disease with Class-Imbalanced Datasets
    Zhu Cuiliang
    Yuan Jiucun
    Yang Chenwei
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3231 - 3238
  • [7] A Cost-Sensitive Ensemble Method for Class-Imbalanced Datasets
    Zhang, Yong
    Wang, Dapeng
    ABSTRACT AND APPLIED ANALYSIS, 2013,
  • [8] Margin calibration in SVM class-imbalanced learning
    Yang, Chan-Yun
    Yang, Jr-Syu
    Wang, Jian-Jun
    NEUROCOMPUTING, 2009, 73 (1-3) : 397 - 411
  • [9] Prototypical Classifier for Robust Class-Imbalanced Learning
    Wei, Tong
    Shi, Jiang-Xin
    Li, Yu-Feng
    Zhang, Min-Ling
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 44 - 57
  • [10] Deep learning approach for defective spot welds classification using small and class-imbalanced datasets
    Dai, Wei
    Li, Dayong
    Tang, Ding
    Wang, Huamiao
    Peng, Yinghong
    NEUROCOMPUTING, 2022, 477 : 46 - 60