Multi-Attribute Learning With Highly Imbalanced Data

被引：2

作者：

Beltran, L. Viviana Beltran ^{[1
]}

Coustaty, Mickael ^{[1
]}

Journet, Nicholas ^{[2
]}

Caicedo, Juan C. ^{[3
]}

Doucet, Antoine ^{[1
]}

机构：

[1] Univ La Rochelle, F-17000 La Rochelle, France

[2] Univ Bordeaux, F-33000 Bordeaux, France

[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA

来源：

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年

关键词：

D O I：

10.1109/ICPR48806.2021.9412634

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data is one of the most important keys for success when studying a simple or a complex phenomenon. With the use of deep learning exploding and its democratization, non-computer science experts may struggle to use highly complex deep learning architectures, even when straightforward models offer them suitable performances. In this article, we study the specific and common problem of data imbalance in real databases as most of the bad performance problems are due to the data itself. We cover two keys aspects. First, we propose ways to deal with the situation when a data set contains different levels of imbalance. Classical imbalanced learning strategies cannot be directly applied when using multi-attribute deep learning models, i.e., multi-task or multi-label architectures. Therefore, one of our contributions is a proposed adaptation to face each one of the problems derived from imbalance. Second, we demonstrate that with little to no imbalance, straightforward deep learning models work well. However, for non-experts, these models can be seen as black boxes, where all efforts are invested in pre-processing the data. To simplify the problem, we perform the classification task without features that are costly to extract, such as part localization which is widely used in the state of the art of attribute classification. We make use of three widely known attribute databases, CUB-200-2011 - CUB as our main use case due to its deeply imbalanced nature, along with two balanced databases: celebA and AwA2. All of them contain multi-attribute annotations. The results of very fine-grained attribute learning demonstrate that in the presence of imbalance, our proposed strategies make it possible to have competitive results against the state of the art while taking advantage of multi-attribute deep learning models. We also noticed an increase in performance while using a specialized loss function (Focal Loss). For CUB, we have competitive results, and for CelebA and AwA2 our strategies over-perform the state of the art.

引用

页码：9219 / 9226

页数：8

共 50 条

[1] Similarity measure for multi-attribute data
Li, CJ
Prabhakaran, B
Zheng, SQ
[J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1149 - 1152
[2] Multi-Attribute Compressive Data Gathering
Chen, Guangshuo
Liu, Xiao-Yang
Kong, Linghe
Lu, Jia-Liang
Wu, Min-You
[J]. 2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2014, : 2178 - 2183
[3] Crowdsourced Selection on Multi-Attribute Data
Weng, Xueping
Li, Guoliang
Hu, Huiqi
Feng, Jianhua
[J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 307 - 316
[4] Multi-attribute Learning for Pedestrian Attribute Recognition in Surveillance Scenarios
Li, Dangwei
Chen, Xiaotang
Huang, Kaiqi
[J]. PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 111 - 115
[5] Sparse-Group Lasso for Graph Learning From Multi-Attribute Data
Tugnai, Jitendra K.
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 1771 - 1786
[6] Graph estimation from multi-attribute data
Kolar, Mladen
Liu, Han
Xing, Eric P.
[J]. Journal of Machine Learning Research, 2014, 15 : 1713 - 1750
[7] Query and Animate Multi-attribute Trajectory Data
Xu, Jianqiu
Gueting, Ralf Hartmut
[J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2551 - 2554
[8] Graph Estimation From Multi-Attribute Data
Kolar, Mladen
Liu, Han
Xing, Eric P.
[J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 1713 - 1750
[9] A multi-attribute evaluating approach based on analysis and learning of attribute coordinate
Feng, JL
Lu, NZ
[J]. APPLIED COMPUTATIONAL INTELLIGENCE, 2004, : 148 - 151
[10] Genetic learning of multi-attribute interactions in speaker verification
Pham, T
[J]. PROCEEDINGS OF THE 2000 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2000, : 379 - 383

← 1 2 3 4 5 →