Clothing Parsing Based on Multi-Scale Fusion and Improved Self-Attention Mechanism

被引:0
|
作者
陈诺 [1 ]
王绍宇 [1 ]
陆然 [1 ]
李文萱 [1 ]
覃志东 [1 ]
石秀金 [1 ]
机构
[1] College of Computer Science and Technology, Donghua University
关键词
D O I
10.19884/j.1672-5220.202303008
中图分类号
TS941 [服装工业]; TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0821 ; 082104 ; 0835 ; 1405 ;
摘要
Due to the lack of long-range association and spatial location information, fine details and accurate boundaries of complex clothing images cannot always be obtained by using the existing deep learning-based methods. This paper presents a convolutional structure with multi-scale fusion to optimize the step of clothing feature extraction and a self-attention module to capture long-range association information. The structure enables the self-attention mechanism to directly participate in the process of information exchange through the down-scaling projection operation of the multi-scale framework. In addition, the improved self-attention module introduces the extraction of 2-dimensional relative position information to make up for its lack of ability to extract spatial position features from clothing images. The experimental results based on the colorful fashion parsing dataset(CFPD) show that the proposed network structure achieves 53.68% mean intersection over union(mIoU) and has better performance on the clothing parsing task.
引用
收藏
页码:661 / 666
页数:6
相关论文
共 50 条
  • [31] A froth image segmentation method via generative adversarial networks with multi-scale self-attention mechanism
    Zhong, Yuze
    Tang, Zhaohui
    Zhang, Hu
    Xie, Yongfang
    Gao, Xiaoliang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19663 - 19682
  • [32] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [33] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    The Visual Computer, 2023, 39 : 2969 - 2980
  • [34] Texture Classification Using Improved ResNet based on Multi-scale Attention Mechanism
    Lu, Qiu
    Chen, Haotian
    Yang, Tiejun
    THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [35] A froth image segmentation method via generative adversarial networks with multi-scale self-attention mechanism
    Yuze Zhong
    Zhaohui Tang
    Hu Zhang
    Yongfang Xie
    Xiaoliang Gao
    Multimedia Tools and Applications, 2024, 83 : 19663 - 19682
  • [36] A Novel Clothing Attribute Representation Network-Based Self-Attention Mechanism
    Chun, Yutong
    Wang, Chuansheng
    He, Mingke
    IEEE ACCESS, 2020, 8 (08): : 201762 - 201769
  • [37] MSSA-Net: A novel multi-scale feature fusion and global self-attention network for lesion segmentation
    Huang, Zhaohong
    Zhang, Xiangchen
    Zhang, Guowei
    Cai, Guorong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (21):
  • [38] A rolling bearing fault diagnosis method for imbalanced data based on multi-scale self-attention mechanism and novel loss function
    Ruiru, Qiang
    Xiaoqiang, Zhao
    Insight: Non-Destructive Testing and Condition Monitoring, 2024, 66 (11): : 690 - 701
  • [39] Multi-scale convolution networks for seismic event classification with windowed self-attention
    Yongming Huang
    Yi Xie
    Wei Liu
    Yongsheng Ma
    Fajun Miao
    Guobao Zhang
    Journal of Seismology, 2025, 29 (1) : 257 - 268
  • [40] Multi-scale self-attention generative adversarial network for pathology image restoration
    Liang, Meiyan
    Zhang, Qiannan
    Wang, Guogang
    Xu, Na
    Wang, Lin
    Liu, Haishun
    Zhang, Cunlin
    VISUAL COMPUTER, 2023, 39 (09): : 4305 - 4321