Reweighting Forest for Extreme Multi-label Classification

被引:1
|
作者
Lin, Zhun-Zheng [1 ]
Dai, Bi-Ru [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Dept Comp Sci & Informat Engn, 43,Sect 4,Keelung Rd, Taipei 106, Taiwan
关键词
Multi-label classification; Random forest; Extreme classification;
D O I
10.1007/978-3-319-64283-3_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, data volume is getting larger along with the fast development of Internet technologies. Some datasets contain a huge number of labels, dimensions and data points. As a result, some of them cannot be loaded by typical classifiers, and some of them require very long and unacceptable time for execution. Extreme multi-label classification is designed for these challenges. Extreme multi-label classification differs from traditional multi-label classification in a number of ways including the need for lower execution time, training at an extreme scale with millions of data points, features and labels, etc. In order to enhance the practicality, in this paper, we focus on designing an extreme multi-label classification approach which can be performed on a single person-al computer. We devise a two-phase framework for dealing with the above issues. In the reweighting phase, the prediction precision is improved by paying more attention on hard-to-classify instances and increasing the diversity of the model. In the pretesting phase, trees with lower quality will be removed from the prediction model for reducing the model size and increasing the prediction precision. Experiments on real world datasets will verify that the pro-posed method is able to generate better prediction results and the model size is successfully shrunk down.
引用
收藏
页码:286 / 299
页数:14
相关论文
共 50 条
  • [1] Extreme Learning Machine for Multi-Label Classification
    Sun, Xia
    Xu, Jingting
    Jiang, Changmeng
    Feng, Jun
    Chen, Su-Shing
    He, Feijuan
    [J]. ENTROPY, 2016, 18 (06)
  • [2] Multi-Label Classification with Extreme Learning Machine
    Kongsorot, Yanika
    Horata, Punyaphol
    [J]. 2014 6TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST), 2014, : 81 - 86
  • [3] Extreme Multi-label Classification for Information Retrieval
    Dembczynski, Krzysztof
    Babbar, Rohit
    [J]. ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 839 - 840
  • [4] Evaluating Extreme Hierarchical Multi-label Classification
    Amigo, Enrique
    Delgado, Agustin D.
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5809 - 5819
  • [5] ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification
    Wu, Qingyao
    Tan, Mingkui
    Song, Hengjie
    Chen, Jian
    Ng, Michael K.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (10) : 2665 - 2680
  • [6] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [7] Sparse Local Embeddings for Extreme Multi-label Classification
    Bhatia, Kush
    Jain, Himanshu
    Kar, Purushottam
    Varma, Manik
    Jain, Prateek
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [8] Correlation Networks for Extreme Multi-label Text Classification
    Xun, Guangxu
    Jha, Kishlay
    Sun, Jianhui
    Zhang, Aidong
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1074 - 1082
  • [9] Data scarcity, robustness and extreme multi-label classification
    Rohit Babbar
    Bernhard Schölkopf
    [J]. Machine Learning, 2019, 108 : 1329 - 1351
  • [10] Data scarcity, robustness and extreme multi-label classification
    Babbar, Rohit
    Schoelkopf, Bernhard
    [J]. MACHINE LEARNING, 2019, 108 (8-9) : 1329 - 1351