Clothing Parsing Based on Multi-Scale Fusion and Improved Self-Attention Mechanism

被引:0
|
作者
陈诺 [1 ]
王绍宇 [1 ]
陆然 [1 ]
李文萱 [1 ]
覃志东 [1 ]
石秀金 [1 ]
机构
[1] College of Computer Science and Technology, Donghua University
关键词
D O I
10.19884/j.1672-5220.202303008
中图分类号
TS941 [服装工业]; TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0821 ; 082104 ; 0835 ; 1405 ;
摘要
Due to the lack of long-range association and spatial location information, fine details and accurate boundaries of complex clothing images cannot always be obtained by using the existing deep learning-based methods. This paper presents a convolutional structure with multi-scale fusion to optimize the step of clothing feature extraction and a self-attention module to capture long-range association information. The structure enables the self-attention mechanism to directly participate in the process of information exchange through the down-scaling projection operation of the multi-scale framework. In addition, the improved self-attention module introduces the extraction of 2-dimensional relative position information to make up for its lack of ability to extract spatial position features from clothing images. The experimental results based on the colorful fashion parsing dataset(CFPD) show that the proposed network structure achieves 53.68% mean intersection over union(mIoU) and has better performance on the clothing parsing task.
引用
收藏
页码:661 / 666
页数:6
相关论文
共 50 条
  • [31] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    [J]. VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [32] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    [J]. The Visual Computer, 2023, 39 : 2969 - 2980
  • [33] A froth image segmentation method via generative adversarial networks with multi-scale self-attention mechanism
    Yuze Zhong
    Zhaohui Tang
    Hu Zhang
    Yongfang Xie
    Xiaoliang Gao
    [J]. Multimedia Tools and Applications, 2024, 83 : 19663 - 19682
  • [34] Texture Classification Using Improved ResNet based on Multi-scale Attention Mechanism
    Lu, Qiu
    Chen, Haotian
    Yang, Tiejun
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [35] A Novel Clothing Attribute Representation Network-Based Self-Attention Mechanism
    Chun, Yutong
    Wang, Chuansheng
    He, Mingke
    [J]. IEEE ACCESS, 2020, 8 (08): : 201762 - 201769
  • [36] MSSA-Net: A novel multi-scale feature fusion and global self-attention network for lesion segmentation
    Huang, Zhaohong
    Zhang, Xiangchen
    Zhang, Guowei
    Cai, Guorong
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (21):
  • [37] Multi-scale self-attention generative adversarial network for pathology image restoration
    Liang, Meiyan
    Zhang, Qiannan
    Wang, Guogang
    Xu, Na
    Wang, Lin
    Liu, Haishun
    Zhang, Cunlin
    [J]. VISUAL COMPUTER, 2023, 39 (09): : 4305 - 4321
  • [38] Research on clothing patterns generation based on multi-scales self-attention improved generative adversarial network
    Yu, Zi-yan
    Luo, Tian-jian
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2021, 14 (04) : 647 - 663
  • [39] DEEPCHORUS: A HYBRID MODEL OF MULTI-SCALE CONVOLUTION AND SELF-ATTENTION FOR CHORUS DETECTION
    He, Qiqi
    Sun, Xiaoheng
    Yu, Yi
    Li, Wei
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 411 - 415
  • [40] Multi-Scale Aggregation with Self-Attention Network for Modeling Electrical Motor Dynamics
    Huang, Kuan-Chih
    Yang, Hao-Hsiang
    Chen, Wei-Ting
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 7097 - 7103