Multi-Task Collaborative Attention Network for Pedestrian Attribute Recognition

被引:0
|
作者
Cao, Junliang [1 ]
Wei, Hua [1 ]
Sun, Yongli [1 ]
Zhao, Zhifeng [1 ]
Wang, Wei [1 ]
Sun, Guangze [1 ]
Wang, Gang [1 ]
机构
[1] Xian Fiberhome Software Tech, Xian, Peoples R China
关键词
Pedestrian Attribute Recognition; Feature Division Module; Spatial and Channel Collaborative Attention Module; Multi-Task Collaborative Attention Network; Adaptive-Soups;
D O I
10.1109/IJCNN54540.2023.10191574
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pedestrian Attribute Recognition (PAR) is a multi-task attribute leaning problem. Research into person attributes recognition has focused on approaches to describe a person in terms of their appearance. Combination of some attributes is helpful to strengthen each other's learning such as upper clothing style and upper clothing length, while others are not, such as hair style and upper clothing length. Thus, how to effectively combine different task is the key challenges in PAR. To effectively utilizing the relationship between attributes and further improve the effects of PAR, we propose a novel Multi-Task Collaborative Attention Network (MTCAN), which consists of three modules. Specifically, we first design a Feature Division Module (FDM) to focus on reliable and flexible attribute-related regions. Based on the precise attribute-related locations, we further construct a Spatial and Channel Collaborative Attention Module (SCCAM) to facilitate the beneficial features and weaken mutually suppressed features. Thirdly, a newly weights fusion strategy named adaptive-soups is proposed to mine the optimal model which is universal for deep learning models in all fields. Experiments on two pedestrian attribute recognition datasets show that our proposed method achieves superior performance against other state-of-the-art methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Multi-task learning network for handwritten numeral recognition
    Hou, Jinhui
    Zeng, Huanqiang
    Cai, Lei
    Zhu, Jianqing
    Chen, Jing
    Cai, Canhui
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (02) : 843 - 850
  • [32] Emotion recognition from EEG based on multi-task learning with capsule network and attention mechanism
    Li, Chang
    Wang, Bin
    Zhang, Silin
    Liu, Yu
    Song, Rencheng
    Cheng, Juan
    Chen, Xun
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 143
  • [33] Vehicle recognition using multi-task cascaded network
    Gong, Hua
    Zhang, Yong
    Liu, Fang
    Xu, Ke
    [J]. FIFTH SYMPOSIUM ON NOVEL OPTOELECTRONIC DETECTION TECHNOLOGY AND APPLICATION, 2019, 11023
  • [34] Dual-branch self-attention network for pedestrian attribute recognition
    Liu, Zhenyu
    Zhang, Zhang
    Li, Da
    Zhang, Peng
    Shan, Caifeng
    [J]. PATTERN RECOGNITION LETTERS, 2022, 163 : 112 - 120
  • [35] Adaptively Weighted Multi-task Deep Network for Person Attribute Classification
    He, Keke
    Wang, Zhanxiong
    Fu, Yanwei
    Feng, Rui
    Jiang, Yu-Gang
    Xue, Xiangyang
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1636 - 1644
  • [36] Face Attribute Estimation Using Multi-Task Convolutional Neural Network
    Kawai, Hiroyarr
    Ito, Koichi
    Aoki, Takafumi
    [J]. JOURNAL OF IMAGING, 2022, 8 (04)
  • [37] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models
    Fang, Wenhua
    Chen, Jun
    Hu, Ruimin
    [J]. CHINA COMMUNICATIONS, 2018, 15 (12) : 208 - 219
  • [38] Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-task CNN Models
    Fang, Wenhua
    Chen, Jun
    Lu, Tao
    Hu, Ruimin
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2018, PT II, 2018, 11165 : 758 - 767
  • [39] Multi-Task Multi-Attention Transformer for Generative Named Entity Recognition
    Mo, Ying
    Liu, Jiahao
    Tang, Hongyin
    Wang, Qifan
    Xu, Zenglin
    Wang, Jingang
    Quan, Xiaojun
    Wu, Wei
    Li, Zhoujun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4171 - 4183
  • [40] MTSAN: Multi-Task Semantic Attention Network for ADAS Applications
    Lai, Chun-Yu
    Wu, Bo-Xun
    Shivanna, Vinay Malligere
    Guo, Jiun-In
    [J]. IEEE ACCESS, 2021, 9 (50700-50714) : 50700 - 50714