Max360IQ: Blind omnidirectional image quality assessment with multi-axis attention

被引:0
|
作者
Yan, Jiebin [1 ]
Tan, Ziwen [1 ]
Fang, Yuming [1 ]
Rao, Jiale [1 ]
Zuo, Yifan [1 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Comp & Artificial Intelligence, Nanchang, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Omnidirectional images; Perceptual quality assessment; Multi-axis attention;
D O I
10.1016/j.patcog.2025.111429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Omnidirectional image, also called 360-degree image, is able to capture the entire 360-degree scene, thereby providing more realistic immersive feelings for users than general 2D image and stereoscopic image. Meanwhile, this feature brings great challenges to measuring the perceptual quality of omnidirectional images, which is closely related to users' quality of experience, especially when the omnidirectional images suffer from non-uniform distortion. In this paper, we propose a novel and effective blind omnidirectional image quality assessment (BOIQA) model with multi-axis attention (Max360IQ), which can proficiently measure not only the quality of uniformly distorted omnidirectional images but also the quality of non-uniformly distorted omnidirectional images. Specifically, the proposed Max360IQ is mainly composed of a backbone with stacked multi-axis attention modules for capturing both global and local spatial interactions of extracted viewports, a multi-scale feature integration (MSFI) module to fuse multi-scale features and a quality regression module with deep semantic guidance for predicting the quality of omnidirectional images. Experimental results demonstrate that the proposed Max360IQ outperforms the state-of-the-art Assessor360 by 3.6% in terms of SRCC on the JUFE database with non-uniform distortion, and gains improvement of 0.4% and 0.8% in terms of SRCC on the OIQA and CVIQ databases, respectively. The source code is available at https://github.com/WenJuing/Max360IQ.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] AM-BQA: Enhancing blind image quality assessment using attention retractable features and multi-dimensional learning
    Adhikari, Astha
    Lee, Sang-Woong
    IMAGE AND VISION COMPUTING, 2024, 147
  • [32] Blind image quality assessment based on progressive multi-task learning
    Li, Aobo
    Wu, Jinjian
    Tian, Shiwei
    Li, Leida
    Dong, Weisheng
    Shi, Guangming
    NEUROCOMPUTING, 2022, 500 (307-318) : 307 - 318
  • [33] Blind 360-degree image quality assessment via saliency-guided convolution neural network
    Qiu, Miaomiao
    Shao, Feng
    OPTIK, 2021, 240
  • [34] Multitask Deep Neural Network With Knowledge-Guided Attention for Blind Image Quality Assessment
    Zhou, Tianwei
    Tan, Songbai
    Zhao, Baoquan
    Yue, Guanghui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7577 - 7588
  • [35] MSFE: BLIND IMAGE QUALITY ASSESSMENT BASED ON MULTI-STAGE FEATURE ENCODING
    Jiang, Qiuping
    Shao, Feng
    Jiang, Gangyi
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3160 - 3164
  • [36] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment
    Yang, Sidi
    Wu, Tianhe
    Shi, Shuwei
    Lao, Shanshan
    Gong, Yuan
    Cao, Mingdeng
    Wang, Jiahao
    Yang, Yujiu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1190 - 1199
  • [37] Multi-dataset learning with channel modulation loss for blind image quality assessment
    Li H.
    Yan Z.
    Fan X.
    Multimedia Tools and Applications, 2024, 83 (40) : 88159 - 88178
  • [38] Blind image quality assessment based on sparse representation of multi-level information
    Li B.
    Tian M.
    Zhang W.
    Wang X.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (08): : 40 - 45
  • [39] Three-stream network and improved attention mechanism-based blind image quality assessment
    Zheng, Jing
    Cui, Ziguan
    Gan, Zongliang
    Tang, Guijin
    Liu, Feng
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (01)
  • [40] Adaptive Patch Labeling and Multi-label Feature Selection for 360-Degree Image Quality Assessment
    Sendjasni, Abderrezzaq
    Larabi, Mohamed-Chaker
    Benkabou, Seif-Eddine
    2023 IEEE 25TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, MMSP, 2023,