SPIQ: A Self-Supervised Pre-Trained Model for Image Quality Assessment

被引：0

作者：

Chen, Pengfei ^{[1
]}

Li, Leida ^{[2
]}

Wu, Qingbo ^{[3
]}

Wu, Jinjian ^{[2
]}

机构：

[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China

[2] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China

[3] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2022年 / 29卷

关键词：

Distortion; Feature extraction; Task analysis; Transformers; Training; Predictive models; Image quality; Blind image quality assessment; self-supervised pre-training; contrastive learning; INDEX;

D O I：

10.1109/LSP.2022.3145326

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Blind image quality assessment (BIQA) has witnessed a flourishing progress due to the rapid advances in deep learning technique. The vast majority of prior BIQA methods try to leverage models pre-trained on ImageNet to mitigate the data shortage problem. These well-trained models, however, can be sub-optimal when applied to BIQA task that varies considerably from the image classification domain. To address this issue, we make the first attempt to leverage the plentiful unlabeled data to conduct self-supervised pre-training for BIQA task. Based on the distorted images generated from the high-quality samples using the designed distortion augmentation strategy, the proposed pre-training is implemented by a feature representation prediction task. Specifically, patch-wise feature representations corresponding to a certain grid are integrated to make prediction for the representation of the patch below it. The prediction quality is then evaluated using a contrastive loss to capture quality-aware information for BIQA task. Experimental results conducted on KADID-10 k and KonIQ-10 k databases demonstrate that the learned pre-trained model can significantly benefit the existing learning based IQA models.

引用

页码：513 / 517

页数：5

共 50 条

[31] Self-Supervised Pre-Trained Speech Representation Based End-to-End Mispronunciation Detection and Diagnosis of Mandarin
Shen, Yunfei
Liu, Qingqing
Fan, Zhixing
Liu, Jiajun
Wumaier, Aishan
[J]. IEEE ACCESS, 2022, 10 : 106451 - 106462
[32] END-TO-END SPOKEN LANGUAGE UNDERSTANDING USING TRANSFORMER NETWORKS AND SELF-SUPERVISED PRE-TRAINED FEATURES
Morais, Edmilson
Kuo, Hong-Kwang J.
Thomas, Samuel
Tuske, Zoltan
Kingsbury, Brian
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7483 - 7487
[33] Contrastive Self-Supervised Pre-Training for Video Quality Assessment
Chen, Pengfei
Li, Leida
Wu, Jinjian
Dong, Weisheng
Shi, Guangming
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 458 - 471
[34] Image quality assessment based on self-supervised learning and knowledge distillation
Sang, Qingbing
Shu, Ziru
Liu, Lixiong
Hu, Cong
Wu, Qin
[J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 90
[35] Toward Leveraging Pre-Trained Self-Supervised Frontends for Automatic Singing Voice Understanding Tasks: Three Case Studies
Yamamoto, Yuya
[J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1745 - 1752
[36] Applying Self-Supervised Learning to Image Quality Assessment in Chest CT Imaging
Pouget, Eleonore
Dedieu, Veronique
[J]. BIOENGINEERING-BASEL, 2024, 11 (04):
[37] Improved Image Quality Assessment by Utilizing Pre-Trained Architecture Features with Unified Learning Mechanism
Ryu, Jihyoung
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (04):
[38] Pre-Trained Image Processing Transformer
Chen, Hanting
Wang, Yunhe
Guo, Tianyu
Xu, Chang
Deng, Yiping
Liu, Zhenhua
Ma, Siwei
Xu, Chunjing
Xu, Chao
Gao, Wen
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12294 - 12305
[39] Grading the severity of diabetic retinopathy using an ensemble of self-supervised pre-trained convolutional neural networks: ESSP-CNNs
Parsa, Saeed
Khatibi, Toktam
[J]. Multimedia Tools and Applications, 2024, 83 (42) : 89837 - 89870
[40] Targeted Image Reconstruction by Sampling Pre-trained Diffusion Model
Zheng, Jiageng
[J]. INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2023, 2024, 822 : 552 - 560

← 1 2 3 4 5 →