Clothing Parsing Based on Multi-Scale Fusion and Improved Self-Attention Mechanism

被引:0
|
作者
陈诺 [1 ]
王绍宇 [1 ]
陆然 [1 ]
李文萱 [1 ]
覃志东 [1 ]
石秀金 [1 ]
机构
[1] College of Computer Science and Technology, Donghua University
关键词
D O I
10.19884/j.1672-5220.202303008
中图分类号
TS941 [服装工业]; TP391.41 []; TP18 [人工智能理论];
学科分类号
080203 ; 081104 ; 0812 ; 0821 ; 082104 ; 0835 ; 1405 ;
摘要
Due to the lack of long-range association and spatial location information, fine details and accurate boundaries of complex clothing images cannot always be obtained by using the existing deep learning-based methods. This paper presents a convolutional structure with multi-scale fusion to optimize the step of clothing feature extraction and a self-attention module to capture long-range association information. The structure enables the self-attention mechanism to directly participate in the process of information exchange through the down-scaling projection operation of the multi-scale framework. In addition, the improved self-attention module introduces the extraction of 2-dimensional relative position information to make up for its lack of ability to extract spatial position features from clothing images. The experimental results based on the colorful fashion parsing dataset(CFPD) show that the proposed network structure achieves 53.68% mean intersection over union(mIoU) and has better performance on the clothing parsing task.
引用
收藏
页码:661 / 666
页数:6
相关论文
共 50 条
  • [21] Automated detection of sleep-arousal using multi-scale convolution and self-attention mechanism
    Li F.
    Xu Y.
    Zhang B.
    Cong F.
    [J]. Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2023, 40 (01): : 27 - 34
  • [22] Coarse-to-Fine bone age regression by using multi-scale self-attention mechanism
    Wu, Guanyu
    Wang, Ziming
    Peng, Jian
    Gao, Shaobing
    [J]. Biomedical Signal Processing and Control, 2025, 100
  • [23] Footprint Pressure Image Retrieval Algorithm Based on Multi-scale Self-attention Convolution
    Zhu M.
    Wang T.
    Wang N.
    Tang J.
    Lu X.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2020, 33 (12): : 1097 - 1103
  • [24] Multi-scale quaternion CNN and BiGRU with cross self-attention feature fusion for fault diagnosis of bearing
    Liu, Huanbai
    Zhang, Fanlong
    Tan, Yin
    Huang, Lian
    Li, Yan
    Huang, Guoheng
    Luo, Shenghong
    Zeng, An
    [J]. MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (08)
  • [25] Video Salient Object Detection Using Multi-Scale Self-Attention
    [J]. Liu, Jiahao (jiahao.liu@akane.waseda.jp), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [26] MSGSA: Multi-Scale Guided Self-Attention Network for Crowd Counting
    Sun, Yange
    Li, Meng
    Guo, Huaping
    Zhang, Li
    [J]. ELECTRONICS, 2023, 12 (12)
  • [27] Crowd counting using a self-attention multi-scale cascaded network
    Li, He
    Zhang, Shihui
    Kong, Weihang
    [J]. IET COMPUTER VISION, 2019, 13 (06) : 556 - 561
  • [28] A Multi-Scale Detector Based on Attention Mechanism
    Zhou, Lukuan
    Wang, Wei
    Wang, Qiang
    Sheng, Biyun
    Yang, Wankou
    [J]. 2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 110 - 115
  • [29] Peach Flower Density Detection Based on an Improved CNN Incorporating Attention Mechanism and Multi-Scale Feature Fusion
    Tao, Kun
    Wang, Aichen
    Shen, Yidie
    Lu, Zemin
    Peng, Futian
    Wei, Xinhua
    [J]. HORTICULTURAE, 2022, 8 (10)
  • [30] Flower image classification based on an improved lightweight neural network with multi-scale feature fusion and attention mechanism
    Zeng, Zhigao
    Huang, Cheng
    Zhu, Wenqiu
    Wen, Zhiqiang
    Yuan, Xinpan
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (08) : 13900 - 13920