Multi-Scale Attention Network for Image Cropping

被引:0
|
作者
Lian, Tianpei [1 ]
Xian, Ke [1 ]
Pan, Zhiyu [1 ]
Hong, Chaoyi [1 ]
Cao, Zhiguo [1 ]
Zhong, Weicai [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[2] Huawei CBG Consumer Cloud Serv Big Data Platform, Xian 710075, Peoples R China
基金
中国国家自然科学基金;
关键词
Image cropping; aesthetic quality; multi-scale attention; deep learning; VISUAL-ATTENTION; MODEL;
D O I
10.1109/CAC51589.2020.9326681
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic image cropping is a completely practical but challenging task which aims to improve the aesthetic quality of an image by removing irrelevant areas. Most previous image cropping methods ignored compositional relationships among different regions of a given image. Global compositional relationships are extremely important for cropping models to decide whether to reserve a certain object of an input image. In this work, we propose a multi-scale attention network (MSANet) to address this issue. We employ three plug-and-play attention modules to catch the context on three different scales. The multi-scale attention (MSA) module ensures that our model perceives objects of different sizes and preserve needed areas. Moreover, we design a border-reserved grid anchor based formulation to better handle the situations where the subjects are at the edge of input images. The cosine similarity loss function is also utilized to acquire stable results. Extensive quantitative and qualitative experimental results show that our model is well aware of the compositional relationships of images. Compared to existing works, our multi-scale attention network achieves state-of-the-art performance with less time and lighter weights.
引用
收藏
页码:2640 / 2645
页数:6
相关论文
共 50 条
  • [31] A Medical Image Segmentation Network with Multi-Scale and Dual-Branch Attention
    Zhu, Cancan
    Cheng, Ke
    Hua, Xuecheng
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [32] Multi-scale recurrent attention gated fusion network for single image dehazing
    Zhang, Xiangfen
    Yang, Shuo
    Zhang, Qingyi
    Yuan, Feiniu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 101
  • [33] MSA-Net: Multi-scale attention network for image splicing localization
    Caiping Yan
    Huajian Wei
    Zhi Lan
    Hong Li
    [J]. Multimedia Tools and Applications, 2024, 83 : 20587 - 20604
  • [35] A lightweight multi-scale channel attention network for image super-resolution
    Li, Wenbin
    Li, Juefei
    Li, Jinxin
    Huang, Zhiyong
    Zhou, Dengwen
    [J]. NEUROCOMPUTING, 2021, 456 : 327 - 337
  • [36] Image Interpolation Using Multi-Scale Attention-Aware Inception Network
    Ji, Jiahuan
    Zhong, Baojiang
    Ma, Kai-Kuang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9413 - 9428
  • [37] Automatic lumbar spinal MRI image segmentation with a multi-scale attention network
    Haixing Li
    Haibo Luo
    Wang Huan
    Zelin Shi
    Chongnan Yan
    Lanbo Wang
    Yueming Mu
    Yunpeng Liu
    [J]. Neural Computing and Applications, 2021, 33 : 11589 - 11602
  • [38] Underwater Image Enhancement Based on Multi-Scale Feature Fusion and Attention Network
    Liu, Yuzhen
    Liu, Meiyi
    Lin, Sen
    Tao, Zhiyong
    [J]. Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2023, 35 (05): : 685 - 695
  • [39] MSA-Net: Multi-scale attention network for image splicing localization
    Yan, Caiping
    Wei, Huajian
    Lan, Zhi
    Li, Hong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 20587 - 20604
  • [40] Image super-resolution with multi-scale fractal residual attention network
    Song, Xiaogang
    Liu, Wanbo
    Liang, Li
    Shi, Weiwei
    Xie, Guo
    Lu, Xiaofeng
    Hei, Xinhong
    [J]. COMPUTERS & GRAPHICS-UK, 2023, 113 : 21 - 31