Omnidirectional image quality assessment with local-global vision transformers

被引:0
|
作者
Tofighi, Nafiseh Jabbari [1 ,2 ]
Elfkir, Mohamed Hedi [3 ]
Imamoglu, Nevrez [4 ]
Ozcinar, Cagri [5 ]
Erdem, Aykut [1 ,2 ]
Erdem, Erkut [3 ]
机构
[1] Koc Univ, Dept Comp Engn, Istanbul, Turkiye
[2] Koc Univ Is Bank AI Ctr, Istanbul, Turkiye
[3] Hacettepe Univ, Dept Comp Engn, Ankara, Turkiye
[4] Natl Inst Adv Ind Sci & Technol, Digital Architecture Res Ctr, Tokyo, Japan
[5] Msk Ai, London, England
关键词
360-degree images; Image quality assessment; Vision transformers; NATURAL SCENE STATISTICS; SIMILARITY INDEX; PERCEPTION; DEVIATION; EFFICIENT;
D O I
10.1016/j.imavis.2024.105151
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rising popularity of omnidirectional images (ODIs) in virtual reality applications, the need for specialized image quality assessment (IQA) methods becomes increasingly critical. Traditional IQA approaches, designed for rectilinear images, often fail to evaluate ODIs accurately due to their 360 -degree scene representation. Addressing this, we introduce the Local - Global Transformer for 360 -degree Image Quality Assessment (LGT360IQ). This novel framework features dual branches tailored to mimic top -down and bottom -up visual attention mechanisms, adapted for the spherical characteristics of ODIs. The local branch processes tangent viewports from salient regions within the equirectangular projection image, extracting detailed features for granular quality assessment. In parallel, the global branch utilizes a task -dependent token sampling strategy for holistic image feature processing and quality score prediction. This integrated approach combines local and global information, offering an effective IQA method for ODIs. Our extensive evaluation across three benchmark ODI datasets, CVIQ, OIQA, and ODI, demonstrates LGT360IQ superior performance and establishes its role in advancing the field of IQA for omnidirectional images.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Local-global coordination with transformers for referring image segmentation
    Liu, Fang
    Kong, Yuqiu
    Zhang, Lihe
    Feng, Guang
    Yin, Baocai
    [J]. NEUROCOMPUTING, 2023, 522 : 39 - 52
  • [2] BLIND OMNIDIRECTIONAL IMAGE QUALITY ASSESSMENT: INTEGRATING LOCAL STATISTICS AND GLOBAL SEMANTICS
    Zhou, Wei
    Wang, Zhou
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1405 - 1409
  • [3] Distilling Vision Transformers for no-reference Perceptual CT Image Quality Assessment
    Baldeon-Calisto, Maria G.
    Rivera-Velastegui, Francisco
    Lai-Yuen, Susana K.
    Riofrio, Daniel
    Perez-Perez, Noel
    Benitez, Diego
    Flores-Moyano, Ricardo
    [J]. MEDICAL IMAGING 2024: IMAGE PROCESSING, 2024, 12926
  • [4] Local-global visual interaction attention for image captioning
    Wang, Changzhi
    Gu, Xiaodong
    [J]. DIGITAL SIGNAL PROCESSING, 2022, 130
  • [5] Enhancing Local-Global Representation Learning for Image Restoration
    Cui, Yuning
    Knoll, Alois
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 6522 - 6530
  • [6] Effective Local-Global Transformer for Natural Image Matting
    Hu, Liangpeng
    Kong, Yating
    Li, Jide
    Li, Xiaoqiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (08) : 3888 - 3898
  • [7] A Local-Global Interactive Vision Transformer for Aerial Scene Classification
    Peng, Ting
    Yi, Jingjun
    Fang, Yuan
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [8] Combining Global and Local Variation for Image Quality Assessment
    Gao, Min-Juan
    Dang, Hong-She
    Wei, Li-Li
    Wang, Hai-Long
    Zhang, Xuan-De
    [J]. Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (12): : 2662 - 2671
  • [9] Perceptual Image Quality Assessment with Transformers
    Cheon, Manri
    Yoon, Sung-Jun
    Kang, Byungyeon
    Lee, Junwoo
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 433 - 442
  • [10] DistilIQA: Distilling Vision Transformers for no-reference perceptual CT image quality assessment
    Baldeon-Calisto, Maria
    Rivera-Velastegui, Francisco
    Lai-Yuen, Susana K.
    Riofrío, Daniel
    Pérez-Pérez, Noel
    Benítez, Diego
    Flores-Moyano, Ricardo
    [J]. Computers in Biology and Medicine, 2024, 177