Hierarchical Classification for Large-Scale Learning

被引:0
|
作者
Wang, Boshi [1 ]
Barbu, Adrian [1 ]
机构
[1] Florida State Univ, Dept Stat, Tallahassee, FL 32306 USA
关键词
large-scale learning; hierarchical classification; incremental class learning;
D O I
10.3390/electronics12224646
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks (DNNs) have drawn much attention due to their success in various vision tasks. Current DNNs are used on data with a relatively small number of classes (e.g., 1000 or less) and employ a fully connected layer for classification, which allocates one neuron for each class and thus, per-example, the classification scales as O(K) with the number of classes K. This approach is computationally intensive for many real-life applications where the number of classes is very large (e.g., tens of thousands of classes). To address this problem, our paper introduces a hierarchical approach for classification with a large number of classes that scales as O(K) and could be extended to O(logK) with a deeper hierarchy. The method, called Hierarchical PPCA, uses a self-supervised pretrained feature extractor to obtain meaningful features and trains Probabilistic PCA models on the extracted features for each class separately, making it easy to add classes without retraining the whole model. The Mahalanobis distance is used to obtain the classification result. To speed-up classification, the proposed Hierarchical PPCA framework clusters the image class models, represented as Gaussians, into a smaller number of super-classes using a modified k-means clustering algorithm. The classification speed increase is obtained by Hierarchical PPCA assigning a sample to a small number of the most likely super-classes and restricting the image classification to the image classes corresponding to these super-classes. The fact that the model is trained on each class separately makes it applicable to training on very large datasets such as the whole ImageNet with more than 10,000 classes. Experiments on three standard datasets (ImageNet-100, ImageNet-1k,and ImageNet-10k) indicate that the hierarchical classifier can achieve a superior accuracy with up to a 16-fold speed increase compared to a standard fully connected classifier.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Cost-sensitive Learning for Large-scale Hierarchical Classification
    Chen, Jianfu
    Warren, David
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1351 - 1360
  • [2] Hierarchical learning of large-margin metrics for large-scale image classification
    Lei, Hao
    Mei, Kuizhi
    Xin, Jingmin
    Dong, Peixiang
    Fan, Jianping
    [J]. NEUROCOMPUTING, 2016, 208 : 46 - 58
  • [3] Joint Hierarchical Category Structure Learning and Large-Scale Image Classification
    Qu, Yanyun
    Lin, Li
    Shen, Fumin
    Lu, Chang
    Wu, Yang
    Xie, Yuan
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (09) : 4331 - 4346
  • [4] Adaptive Classifier Selection in Large-Scale Hierarchical Classification
    Partalas, Ioannis
    Babbar, Rohit
    Gaussier, Eric
    Amblard, Cecile
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 612 - 619
  • [5] Embedding Feature Selection for Large-scale Hierarchical Classification
    Naik, Azad
    Rangwala, Huzefa
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1212 - 1221
  • [6] Exploring and exploiting hierarchical structures for large-scale classification
    Zheng, Junyan
    Wang, Yu
    Pei, Shenglei
    Hu, Qinghua
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (06) : 2427 - 2437
  • [7] Integrated Framework for Improving Large-scale Hierarchical Classification
    Naik, Azad
    Rangwala, Huzefa
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 281 - 288
  • [8] Hierarchical Semantic Risk Minimization for Large-Scale Classification
    Wang, Yu
    Wang, Zhou
    Hu, Qinghua
    Zhou, Yucan
    Su, Honglei
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 9546 - 9558
  • [9] Hierarchical Learning for Large-Scale Image Classification via CNN and Maximum Confidence Path
    Lu, Chang
    Qu, Yanyun
    Shi, Cuiting
    Fan, Jianping
    Wu, Yang
    Wang, Hanzi
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT II, 2015, 9315 : 236 - 245
  • [10] Hierarchical learning of multi-task sparse metrics for large-scale image classification
    Zheng, Yu
    Fan, Jianping
    Zhang, Ji
    Gao, Xinbo
    [J]. PATTERN RECOGNITION, 2017, 67 : 97 - 109