Multi-Task Collaborative Attention Network for Pedestrian Attribute Recognition

被引：0

作者：

Cao, Junliang ^{[1
]}

Wei, Hua ^{[1
]}

Sun, Yongli ^{[1
]}

Zhao, Zhifeng ^{[1
]}

Wang, Wei ^{[1
]}

Sun, Guangze ^{[1
]}

Wang, Gang ^{[1
]}

机构：

[1] Xian Fiberhome Software Tech, Xian, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

关键词：

Pedestrian Attribute Recognition; Feature Division Module; Spatial and Channel Collaborative Attention Module; Multi-Task Collaborative Attention Network; Adaptive-Soups;

D O I：

10.1109/IJCNN54540.2023.10191574

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pedestrian Attribute Recognition (PAR) is a multi-task attribute leaning problem. Research into person attributes recognition has focused on approaches to describe a person in terms of their appearance. Combination of some attributes is helpful to strengthen each other's learning such as upper clothing style and upper clothing length, while others are not, such as hair style and upper clothing length. Thus, how to effectively combine different task is the key challenges in PAR. To effectively utilizing the relationship between attributes and further improve the effects of PAR, we propose a novel Multi-Task Collaborative Attention Network (MTCAN), which consists of three modules. Specifically, we first design a Feature Division Module (FDM) to focus on reliable and flexible attribute-related regions. Based on the precise attribute-related locations, we further construct a Spatial and Channel Collaborative Attention Module (SCCAM) to facilitate the beneficial features and weaken mutually suppressed features. Thirdly, a newly weights fusion strategy named adaptive-soups is proposed to mine the optimal model which is universal for deep learning models in all fields. Experiments on two pedestrian attribute recognition datasets show that our proposed method achieves superior performance against other state-of-the-art methods.

引用

页数：6

共 50 条

[31] Multi-task learning network for handwritten numeral recognition
Hou, Jinhui
Zeng, Huanqiang
Cai, Lei
Zhu, Jianqing
Chen, Jing
Cai, Canhui
[J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (02) : 843 - 850
[32] Emotion recognition from EEG based on multi-task learning with capsule network and attention mechanism
Li, Chang
Wang, Bin
Zhang, Silin
Liu, Yu
Song, Rencheng
Cheng, Juan
Chen, Xun
[J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 143
[33] Vehicle recognition using multi-task cascaded network
Gong, Hua
Zhang, Yong
Liu, Fang
Xu, Ke
[J]. FIFTH SYMPOSIUM ON NOVEL OPTOELECTRONIC DETECTION TECHNOLOGY AND APPLICATION, 2019, 11023
[34] Dual-branch self-attention network for pedestrian attribute recognition
Liu, Zhenyu
Zhang, Zhang
Li, Da
Zhang, Peng
Shan, Caifeng
[J]. PATTERN RECOGNITION LETTERS, 2022, 163 : 112 - 120
[35] Adaptively Weighted Multi-task Deep Network for Person Attribute Classification
He, Keke
Wang, Zhanxiong
Fu, Yanwei
Feng, Rui
Jiang, Yu-Gang
Xue, Xiangyang
[J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1636 - 1644
[36] Face Attribute Estimation Using Multi-Task Convolutional Neural Network
Kawai, Hiroyarr
Ito, Koichi
Aoki, Takafumi
[J]. JOURNAL OF IMAGING, 2022, 8 (04)
[37] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models
Fang, Wenhua
Chen, Jun
Hu, Ruimin
[J]. CHINA COMMUNICATIONS, 2018, 15 (12) : 208 - 219
[38] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-task CNN Models
Fang, Wenhua
Chen, Jun
Lu, Tao
Hu, Ruimin
[J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 758 - 767
[39] Multi-Task Multi-Attention Transformer for Generative Named Entity Recognition
Mo, Ying
Liu, Jiahao
Tang, Hongyin
Wang, Qifan
Xu, Zenglin
Wang, Jingang
Quan, Xiaojun
Wu, Wei
Li, Zhoujun
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4171 - 4183
[40] MTSAN: Multi-Task Semantic Attention Network for ADAS Applications
Lai, Chun-Yu
Wu, Bo-Xun
Shivanna, Vinay Malligere
Guo, Jiun-In
[J]. IEEE ACCESS, 2021, 9 (50700-50714) : 50700 - 50714

← 1 2 3 4 5 →