Multi-Attribute Learning With Highly Imbalanced Data

被引:2
|
作者
Beltran, L. Viviana Beltran [1 ]
Coustaty, Mickael [1 ]
Journet, Nicholas [2 ]
Caicedo, Juan C. [3 ]
Doucet, Antoine [1 ]
机构
[1] Univ La Rochelle, F-17000 La Rochelle, France
[2] Univ Bordeaux, F-33000 Bordeaux, France
[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
关键词
D O I
10.1109/ICPR48806.2021.9412634
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data is one of the most important keys for success when studying a simple or a complex phenomenon. With the use of deep learning exploding and its democratization, non-computer science experts may struggle to use highly complex deep learning architectures, even when straightforward models offer them suitable performances. In this article, we study the specific and common problem of data imbalance in real databases as most of the bad performance problems are due to the data itself. We cover two keys aspects. First, we propose ways to deal with the situation when a data set contains different levels of imbalance. Classical imbalanced learning strategies cannot be directly applied when using multi-attribute deep learning models, i.e., multi-task or multi-label architectures. Therefore, one of our contributions is a proposed adaptation to face each one of the problems derived from imbalance. Second, we demonstrate that with little to no imbalance, straightforward deep learning models work well. However, for non-experts, these models can be seen as black boxes, where all efforts are invested in pre-processing the data. To simplify the problem, we perform the classification task without features that are costly to extract, such as part localization which is widely used in the state of the art of attribute classification. We make use of three widely known attribute databases, CUB-200-2011 - CUB as our main use case due to its deeply imbalanced nature, along with two balanced databases: celebA and AwA2. All of them contain multi-attribute annotations. The results of very fine-grained attribute learning demonstrate that in the presence of imbalance, our proposed strategies make it possible to have competitive results against the state of the art while taking advantage of multi-attribute deep learning models. We also noticed an increase in performance while using a specialized loss function (Focal Loss). For CUB, we have competitive results, and for CelebA and AwA2 our strategies over-perform the state of the art.
引用
收藏
页码:9219 / 9226
页数:8
相关论文
共 50 条
  • [1] Similarity measure for multi-attribute data
    Li, CJ
    Prabhakaran, B
    Zheng, SQ
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1149 - 1152
  • [2] Multi-Attribute Compressive Data Gathering
    Chen, Guangshuo
    Liu, Xiao-Yang
    Kong, Linghe
    Lu, Jia-Liang
    Wu, Min-You
    [J]. 2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2014, : 2178 - 2183
  • [3] Crowdsourced Selection on Multi-Attribute Data
    Weng, Xueping
    Li, Guoliang
    Hu, Huiqi
    Feng, Jianhua
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 307 - 316
  • [4] Multi-attribute Learning for Pedestrian Attribute Recognition in Surveillance Scenarios
    Li, Dangwei
    Chen, Xiaotang
    Huang, Kaiqi
    [J]. PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 111 - 115
  • [5] Sparse-Group Lasso for Graph Learning From Multi-Attribute Data
    Tugnai, Jitendra K.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 1771 - 1786
  • [6] Graph estimation from multi-attribute data
    Kolar, Mladen
    Liu, Han
    Xing, Eric P.
    [J]. Journal of Machine Learning Research, 2014, 15 : 1713 - 1750
  • [7] Query and Animate Multi-attribute Trajectory Data
    Xu, Jianqiu
    Gueting, Ralf Hartmut
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2551 - 2554
  • [8] Graph Estimation From Multi-Attribute Data
    Kolar, Mladen
    Liu, Han
    Xing, Eric P.
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 1713 - 1750
  • [9] A multi-attribute evaluating approach based on analysis and learning of attribute coordinate
    Feng, JL
    Lu, NZ
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE, 2004, : 148 - 151
  • [10] Genetic learning of multi-attribute interactions in speaker verification
    Pham, T
    [J]. PROCEEDINGS OF THE 2000 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2000, : 379 - 383