Image Quality Caption with Attentive and Recurrent Semantic Attractor Network

被引:5
|
作者
Yang, Wen [1 ]
Wu, Jinjian [1 ]
Li, Leida [1 ]
Dong, Weisheng [1 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
基金
国家重点研发计划;
关键词
image quality assessment; quality caption; hierarchical semantics; degradations; deep neural network;
D O I
10.1145/3474085.3475603
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel quality caption model is inventively developed to assess the image quality with hierarchical semantics. Existing image quality assessment (IQA) methods usually represent image quality with a quantitative value, resulting in inconsistency with human cognition. Generally, human beings are good at perceiving image quality in terms of semantic description rather than quantitative value. Moreover, cognition is a needs-oriented task where hierarchical semantics are extracted. The mediocre quality value fails to reflect degradations on hierarchical semantics. Therefore, a new IQA framework is proposed to describe the quality for needs-oriented cognition. A novel quality caption procedure is firstly introduced, in which the quality is represented as patterns of activation distributed across the diverse degradations on hierarchical semantics. Then, an attentive and recurrent semantic attractor network (ARSANet) is designed to activate the distributed patterns for image quality description. Experiments demonstrate that our method achieves superior performance and is highly compliant with human cognition.
引用
收藏
页码:4501 / 4509
页数:9
相关论文
共 50 条
  • [1] Cascade recurrent neural network for image caption generation
    Wu, Jie
    Hu, Haifeng
    ELECTRONICS LETTERS, 2017, 53 (25) : 1642 - 1643
  • [2] An Image Caption Model Based on the Scene Graph and Semantic Prior Network
    Liu, Weifeng
    Zhang, Nan
    Wang, Yaning
    Di, Wu
    2022 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2022, : 60 - 66
  • [3] A multi-scale attentive recurrent network for image dehazing
    Wang, Yibin
    Yin, Shibai
    Basu, Anup
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32539 - 32565
  • [4] A multi-scale attentive recurrent network for image dehazing
    Yibin Wang
    Shibai Yin
    Anup Basu
    Multimedia Tools and Applications, 2021, 80 : 32539 - 32565
  • [5] WSSAMNet: Weakly Supervised Semantic Attentive Medical Image Registration Network
    Nasser, Sahar Almahfouz
    Kurian, Nikhil Cherian
    Meena, Mohit
    Shamsi, Saqib
    Sethi, Amit
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2022, PT II, 2023, 14092 : 15 - 24
  • [6] RECURRENT ATTENTIVE DECOMPOSITION NETWORK FOR LOW-LIGHT IMAGE ENHANCEMENT
    Gao, Haoyu
    Zhang, Lin
    Zhang, Shunli
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3818 - 3822
  • [7] SPICE: Semantic Propositional Image Caption Evaluation
    Anderson, Peter
    Fernando, Basura
    Johnson, Mark
    Gould, Stephen
    COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 382 - 398
  • [8] Image Caption Generation with Local Semantic and Global Information
    Liu, Xing
    Liu, Weibin
    Xing, Weiwei
    2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 680 - 685
  • [9] Adversarial Image Caption Generator Network
    Dehaqi A.M.
    Seydi V.
    Madadi Y.
    SN Computer Science, 2021, 2 (3)
  • [10] Attentive U-recurrent encoder-decoder network for image dehazing
    Yin, Shibai
    Wang, Yibin
    Yang, Yee-Hong
    NEUROCOMPUTING, 2021, 437 : 143 - 156