Hierarchical classification of data with long-tailed distributions via global and local granulation

被引:7
|
作者
Zhao, Hong [1 ,2 ]
Guo, Shunxin [1 ]
Lin, Yaojin [1 ,2 ]
机构
[1] Minnan Normal Univ, Sch Comp Sci, Zhangzhou 363000, Fujian, Peoples R China
[2] Fujian Prov Univ, Key Lab Data Sci & Intelligence Applicat, Zhangzhou 363000, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical classification; Long-tailed distribution; Global and local granulation; Granular computing; FEATURE-SELECTION; SET;
D O I
10.1016/j.ins.2021.09.059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated learning from datasets with a long-tailed distribution has gradually become a research hotspot due to the increasing complexity of large-scale real-world datasets. Existing solutions to long-tailed data classification usually involve re-balancing strategies for global optimization, which can achieve satisfactory results. However, re-balancing strategies tend to alter the original data. In this paper, we propose a knowledge granulation method based on global and local granulation to assist the hierarchical classification of long-tailed data without altering the original data. Firstly, a global classifier is constructed based on the WordNet knowledge organization's hierarchical structure, which is used to granulate the global data from coarse to fine. Secondly, a local hierarchical classifier adapted to tail data is constructed for tail classes that contain few samples. The hierarchical structure of this local classifier is obtained by granulating the data via spectral clustering rather than by using the semantic hierarchy of classes. Finally, the global classifier is used to preliminarily classify samples, then uncertain samples are further classified by the tail local classifier. Experimental results show that the proposed method outperforms several state-of-the-art models designed for the hierarchical classification of long-tailed data. (c) 2021 Elsevier Inc. All rights reserved.
引用
收藏
页码:536 / 552
页数:17
相关论文
共 50 条
  • [41] Nonlocal Hybrid Network for Long-tailed Image Classification
    Liang, Rongjiao
    Zhang, Shichao
    Zhang, Wenzhen
    Zhang, Guixian
    Tang, Jinyun
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (04)
  • [42] Invariant Feature Learning for Generalized Long-Tailed Classification
    Tang, Kaihua
    Tao, Mingyuan
    Qi, Jiaxin
    Liu, Zhenguang
    Zhang, Hanwang
    [J]. COMPUTER VISION, ECCV 2022, PT XXIV, 2022, 13684 : 709 - 726
  • [43] Long-tailed graph neural networks via graph structure learning for node classification
    Lin, Junchao
    Wan, Yuan
    Xu, Jingwen
    Qi, Xingchen
    [J]. APPLIED INTELLIGENCE, 2023, 53 (17) : 20206 - 20222
  • [44] ROBUST CONFIDENCE INTERVAL FOR LOCATION FOR SYMMETRIC, LONG-TAILED DISTRIBUTIONS
    GROSS, AM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1973, 70 (07) : 1995 - 1997
  • [45] CONFIDENCE-INTERVAL ROBUSTNESS WITH LONG-TAILED SYMMETRIC DISTRIBUTIONS
    GROSS, AM
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1976, 71 (354) : 409 - 416
  • [46] Long-Tailed Recognition by Hierarchical Rebalancing Dual-Classifier
    Zhang, Junsong
    Gao, Linsheng
    Li, Hao
    Zhou, Hao
    [J]. IEEE ACCESS, 2023, 11 : 54839 - 54848
  • [47] Adaptive Hierarchical Representation Learning for Long-Tailed Object Detection
    Li, Banghuai
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 2303 - 2312
  • [48] Hierarchical block aggregation network for long-tailed visual recognition
    Pang, Shanmin
    Wang, Weiye
    Zhang, Renzhong
    Hao, Wenyu
    [J]. NEUROCOMPUTING, 2023, 549
  • [49] Small-world networks: Links with long-tailed distributions
    Jespersen, S
    Blumen, A
    [J]. PHYSICAL REVIEW E, 2000, 62 (05): : 6270 - 6274
  • [50] Flexible parametric models for long-tailed patent count distributions
    Guo, JQ
    Trivedi, PK
    [J]. OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 2002, 64 (01) : 63 - 82