Multi-Scale Attention Network for Image Cropping

被引:0
|
作者
Lian, Tianpei [1 ]
Xian, Ke [1 ]
Pan, Zhiyu [1 ]
Hong, Chaoyi [1 ]
Cao, Zhiguo [1 ]
Zhong, Weicai [2 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Artificial Intelligence & Automat, Wuhan 430074, Peoples R China
[2] Huawei CBG Consumer Cloud Serv Big Data Platform, Xian 710075, Peoples R China
基金
中国国家自然科学基金;
关键词
Image cropping; aesthetic quality; multi-scale attention; deep learning; VISUAL-ATTENTION; MODEL;
D O I
10.1109/CAC51589.2020.9326681
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic image cropping is a completely practical but challenging task which aims to improve the aesthetic quality of an image by removing irrelevant areas. Most previous image cropping methods ignored compositional relationships among different regions of a given image. Global compositional relationships are extremely important for cropping models to decide whether to reserve a certain object of an input image. In this work, we propose a multi-scale attention network (MSANet) to address this issue. We employ three plug-and-play attention modules to catch the context on three different scales. The multi-scale attention (MSA) module ensures that our model perceives objects of different sizes and preserve needed areas. Moreover, we design a border-reserved grid anchor based formulation to better handle the situations where the subjects are at the edge of input images. The cosine similarity loss function is also utilized to acquire stable results. Extensive quantitative and qualitative experimental results show that our model is well aware of the compositional relationships of images. Compared to existing works, our multi-scale attention network achieves state-of-the-art performance with less time and lighter weights.
引用
收藏
页码:2640 / 2645
页数:6
相关论文
共 50 条
  • [1] Multi-scale attention network for image inpainting
    Qin, Jia
    Bai, Huihui
    Zhao, Yao
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 204
  • [2] STACKED MULTI-SCALE ATTENTION NETWORK FOR IMAGE COLORIZATION
    Jiang, Bin
    Xu, Fangqiang
    Xia, Jun
    Yang, Chao
    Huang, Wei
    Huang, Yun
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2225 - 2229
  • [3] Multi-Scale Context Attention Network for Image Retrieval
    Lou, Yihang
    Bai, Yan
    Wang, Shiqi
    Duan, Ling-Yu
    [J]. PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1128 - 1136
  • [4] Multi-scale attention network for image super-resolution
    Wang, Li
    Shen, Jie
    Tang, E.
    Zheng, Shengnan
    Xu, Lizhong
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [5] Multi-scale residual attention network for single image dehazing
    Sheng, Jiechao
    Lv, Guoqiang
    Du, Gang
    Wang, Zi
    Feng, Qibin
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 121
  • [6] Underwater Image Enhancement with Multi-Scale Residual Attention Network
    Ueki, Yosuke
    Ikehara, Masaaki
    [J]. 2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [7] Multi-scale recurrent attention network for image motion deblurring
    Wang, Xiangjun
    Ouyang, Wensen
    [J]. Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2022, 51 (06):
  • [8] Multi-scale network with attention mechanism for underwater image enhancement
    Tao, Ye
    Tang, Jinhui
    Zhao, Xinwei
    Zhou, Chen
    Wang, Chong
    Zhao, Zhonglei
    [J]. NEUROCOMPUTING, 2024, 595
  • [9] Msap: multi-scale attention probabilistic network for underwater image enhancement network
    Chang, Baocai
    Li, Jinjiang
    Wang, Haiyang
    Li, Mengjun
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (SUPPL 1) : 653 - 661
  • [10] Multi-scale Multi-attention Network for Moire Document Image Binarization
    Guo, Yanqing
    Ji, Caijuan
    Zheng, Xin
    Wang, Qianyu
    Luo, Xiangyang
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2021, 90