Image Quality Caption with Attentive and Recurrent Semantic Attractor Network

被引：5

作者：

Yang, Wen ^{[1
]}

Wu, Jinjian ^{[1
]}

Li, Leida ^{[1
]}

Dong, Weisheng ^{[1
]}

Shi, Guangming ^{[1
]}

机构：

[1] Xidian Univ, Sch Artificial Intelligence, Xian, Peoples R China

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

基金：

国家重点研发计划;

关键词：

image quality assessment; quality caption; hierarchical semantics; degradations; deep neural network;

D O I：

10.1145/3474085.3475603

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a novel quality caption model is inventively developed to assess the image quality with hierarchical semantics. Existing image quality assessment (IQA) methods usually represent image quality with a quantitative value, resulting in inconsistency with human cognition. Generally, human beings are good at perceiving image quality in terms of semantic description rather than quantitative value. Moreover, cognition is a needs-oriented task where hierarchical semantics are extracted. The mediocre quality value fails to reflect degradations on hierarchical semantics. Therefore, a new IQA framework is proposed to describe the quality for needs-oriented cognition. A novel quality caption procedure is firstly introduced, in which the quality is represented as patterns of activation distributed across the diverse degradations on hierarchical semantics. Then, an attentive and recurrent semantic attractor network (ARSANet) is designed to activate the distributed patterns for image quality description. Experiments demonstrate that our method achieves superior performance and is highly compliant with human cognition.

引用

页码：4501 / 4509

页数：9

共 50 条

[21] Attentive Visual Semantic Specialized Network for Video Captioning
Perez-Martin, Jesus
Bustos, Benjamin
Perez, Jorge
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5767 - 5774
[22] Recurrent Attention LSTM Model for Image Chinese Caption Generation
Zhang, Chaoying
Dai, Yaping
Cheng, Yanyan
Jia, Zhiyang
Hirota, Kaoru
2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 808 - 813
[23] Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification
Li, Liang
Wang, Shuhui
Jiang, Shuqiang
Huang, Qingming
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1092 - 1100
[24] A Semantic Driven CNN - LSTM Architecture for Personalised Image Caption Generation
Ignatious, Abisha Anto L.
Jeevitha, S.
Madhurambigai, M.
Hemalatha, M.
2019 11TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC 2019), 2019, : 356 - 362
[25] Cross2Self-attentive Bidirectional Recurrent Neural Network with BERT for Biomedical Semantic Text Similarity
Li, Zhengguang
Lin, Hongfei
Shen, Chen
Zheng, Wei
Yang, Zhihao
Wang, Jian
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1051 - 1054
[26] An Image Caption Model Incorporating High-level Semantic Features
Luo, Zhiwang
Hu, Jiwei
Liu, Quan
Deng, Jiamei
ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
[27] Transformer model incorporating local graph semantic attention for image caption
Qian, Kui
Pan, Yuchen
Xu, Hao
Tian, Lei
VISUAL COMPUTER, 2024, 40 (09): : 6533 - 6544
[28] A NOVEL SEMANTIC ATTRIBUTE-BASED FEATURE FOR IMAGE CAPTION GENERATION
Wang, Wei
Ding, Yuxuan
Tian, Chunna
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 3081 - 3085
[29] On Multimodal Semantic Consistency Detection of News Articles with Image Caption Pairs
Chen, Yuwei
Chang, Ming-Ching
2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 355 - 356
[30] Adaptive Text Denoising Network for Image Caption Editing
Yuan, Mengqi
Bao, Bing-Kun
Tan, Zhiyi
Xu, Changsheng
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)

← 1 2 3 4 5 →