Max360IQ: Blind omnidirectional image quality assessment with multi-axis attention

被引:0
|
作者
Yan, Jiebin [1 ]
Tan, Ziwen [1 ]
Fang, Yuming [1 ]
Rao, Jiale [1 ]
Zuo, Yifan [1 ]
机构
[1] Jiangxi Univ Finance & Econ, Sch Comp & Artificial Intelligence, Nanchang, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Omnidirectional images; Perceptual quality assessment; Multi-axis attention;
D O I
10.1016/j.patcog.2025.111429
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Omnidirectional image, also called 360-degree image, is able to capture the entire 360-degree scene, thereby providing more realistic immersive feelings for users than general 2D image and stereoscopic image. Meanwhile, this feature brings great challenges to measuring the perceptual quality of omnidirectional images, which is closely related to users' quality of experience, especially when the omnidirectional images suffer from non-uniform distortion. In this paper, we propose a novel and effective blind omnidirectional image quality assessment (BOIQA) model with multi-axis attention (Max360IQ), which can proficiently measure not only the quality of uniformly distorted omnidirectional images but also the quality of non-uniformly distorted omnidirectional images. Specifically, the proposed Max360IQ is mainly composed of a backbone with stacked multi-axis attention modules for capturing both global and local spatial interactions of extracted viewports, a multi-scale feature integration (MSFI) module to fuse multi-scale features and a quality regression module with deep semantic guidance for predicting the quality of omnidirectional images. Experimental results demonstrate that the proposed Max360IQ outperforms the state-of-the-art Assessor360 by 3.6% in terms of SRCC on the JUFE database with non-uniform distortion, and gains improvement of 0.4% and 0.8% in terms of SRCC on the OIQA and CVIQ databases, respectively. The source code is available at https://github.com/WenJuing/Max360IQ.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] MSA-MaxNet: Multi-Scale Attention Enhanced Multi-Axis Vision Transformer Network for Medical Image Segmentation
    Wu, Wei
    Huang, Junfeng
    Zhang, Mingxuan
    Li, Yichen
    Yu, Qijia
    Zhao, Qi
    JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2024, 28 (24)
  • [22] Blind image quality assessment via learnable attention-based pooling
    Gu, Jie
    Meng, Gaofeng
    Xiang, Shiming
    Pan, Chunhong
    PATTERN RECOGNITION, 2019, 91 : 332 - 344
  • [23] Omnidirectional Image Quality Assessment by Distortion Discrimination Assisted Multi-Stream Network
    Zhou, Yu
    Sun, Yanjing
    Li, Leida
    Gu, Ke
    Fang, Yuming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 1767 - 1777
  • [24] Spherical Triangle Mesh Representation and Multi-channel Residual Graph Convolution Network Based Blind Omnidirectional Image Quality Assessment
    Chao, Yong
    Song, Yong
    Jiang, Zhidi
    Ye, Ziwei
    Cao, Liuyan
    Yu, Mei
    Jiang, Gangyi
    OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VIII, 2021, 11897
  • [25] MAFBLiF: Multi-Scale Attention Feature Fusion-Based Blind Light Field Image Quality Assessment
    Zhou, Rui
    Jiang, Gangyi
    Cui, Yueli
    Chen, Yeyao
    Xu, Haiyong
    Luo, Ting
    Yu, Mei
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (04) : 1266 - 1278
  • [26] Dual-Level Blind Omnidirectional Image Quality Assessment Network Based on Human Visual Perception
    Liu, Deyang
    Zhang, Lu
    Wan, Lifei
    Yao, Wei
    Ma, Jian
    Zhang, Youzhi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (09) : 1076 - 1084
  • [27] Blind Image Quality Assessment Based on Multi-scale KLT
    Yang, Chao
    Zhang, Xinfeng
    An, Ping
    Shen, Liquan
    Kuo, C. -C. Jay
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 1557 - 1566
  • [28] Blind quality assessment of multi-focus image fusion algorithms
    Nava, Rodrigo
    Escalante-Ramirez, Boris
    Cristobal, Gabriel
    OPTICS, PHOTONICS, AND DIGITAL TECHNOLOGIES FOR MULTIMEDIA APPLICATIONS, 2010, 7723
  • [29] PW-360IQA: Perceptually-Weighted Multichannel CNN for Blind 360-Degree Image Quality Assessment
    Sendjasni, Abderrezzaq
    Larabi, Mohamed-Chaker
    SENSORS, 2023, 23 (09)
  • [30] ADGNet: Attention Discrepancy Guided Deep Neural Network for Blind Image Quality Assessment
    Ma, Xiaoyu
    Wang, Yaqi
    Liu, Chang
    Zhang, Suiyu
    Yu, Dingguo
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 1309 - 1318