Image Quality Caption with Attentive and Recurrent Semantic Attractor Network

被引:5
|
作者
Yang, Wen [1 ]
Wu, Jinjian [1 ]
Li, Leida [1 ]
Dong, Weisheng [1 ]
Shi, Guangming [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China
基金
国家重点研发计划;
关键词
image quality assessment; quality caption; hierarchical semantics; degradations; deep neural network;
D O I
10.1145/3474085.3475603
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel quality caption model is inventively developed to assess the image quality with hierarchical semantics. Existing image quality assessment (IQA) methods usually represent image quality with a quantitative value, resulting in inconsistency with human cognition. Generally, human beings are good at perceiving image quality in terms of semantic description rather than quantitative value. Moreover, cognition is a needs-oriented task where hierarchical semantics are extracted. The mediocre quality value fails to reflect degradations on hierarchical semantics. Therefore, a new IQA framework is proposed to describe the quality for needs-oriented cognition. A novel quality caption procedure is firstly introduced, in which the quality is represented as patterns of activation distributed across the diverse degradations on hierarchical semantics. Then, an attentive and recurrent semantic attractor network (ARSANet) is designed to activate the distributed patterns for image quality description. Experiments demonstrate that our method achieves superior performance and is highly compliant with human cognition.
引用
收藏
页码:4501 / 4509
页数:9
相关论文
共 50 条
  • [21] Attentive Visual Semantic Specialized Network for Video Captioning
    Perez-Martin, Jesus
    Bustos, Benjamin
    Perez, Jorge
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5767 - 5774
  • [22] Recurrent Attention LSTM Model for Image Chinese Caption Generation
    Zhang, Chaoying
    Dai, Yaping
    Cheng, Yanyan
    Jia, Zhiyang
    Hirota, Kaoru
    2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 808 - 813
  • [23] Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification
    Li, Liang
    Wang, Shuhui
    Jiang, Shuqiang
    Huang, Qingming
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1092 - 1100
  • [24] A Semantic Driven CNN - LSTM Architecture for Personalised Image Caption Generation
    Ignatious, Abisha Anto L.
    Jeevitha, S.
    Madhurambigai, M.
    Hemalatha, M.
    2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019), 2019, : 356 - 362
  • [25] Cross2Self-attentive Bidirectional Recurrent Neural Network with BERT for Biomedical Semantic Text Similarity
    Li, Zhengguang
    Lin, Hongfei
    Shen, Chen
    Zheng, Wei
    Yang, Zhihao
    Wang, Jian
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1051 - 1054
  • [26] An Image Caption Model Incorporating High-level Semantic Features
    Luo, Zhiwang
    Hu, Jiwei
    Liu, Quan
    Deng, Jiamei
    ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [27] Transformer model incorporating local graph semantic attention for image caption
    Qian, Kui
    Pan, Yuchen
    Xu, Hao
    Tian, Lei
    VISUAL COMPUTER, 2024, 40 (09): : 6533 - 6544
  • [28] A NOVEL SEMANTIC ATTRIBUTE-BASED FEATURE FOR IMAGE CAPTION GENERATION
    Wang, Wei
    Ding, Yuxuan
    Tian, Chunna
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 3081 - 3085
  • [29] On Multimodal Semantic Consistency Detection of News Articles with Image Caption Pairs
    Chen, Yuwei
    Chang, Ming-Ching
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 355 - 356
  • [30] Adaptive Text Denoising Network for Image Caption Editing
    Yuan, Mengqi
    Bao, Bing-Kun
    Tan, Zhiyi
    Xu, Changsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)