Image Quality Distortion Classification Using Vision Transformer

被引:0
|
作者
Lynn, Nay Chi [1 ]
Shimamura, Tetsuya [1 ]
机构
[1] Saitama Univ, Saitama, Japan
关键词
D O I
10.1007/978-3-031-57840-3_32
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a method for classifying image quality distortions to identify common types of distortions typically present in images, utilizing a vision transformer. The method aims to enhance quality-related image processing approaches by identifying specific distortions as the initial step in distortion-based blind image quality assessment (BIQA). This simplifies the quality reconstruction process by tailoring it to the prior knowledge of distortion types, thereby aiding in improving image classification and potentially reducing biases caused by certain distortions. The proposed method is experimented on common benchmark image quality assessment (IQA) databases, including LIVE2008, TID2013, and KADID-10k. To generalize the performance with a larger database, we distorted images using four general distortion types: Gaussian noise, Gaussian blur, JPEG compression, and contrast degradation, applied to the ImageNet-1k database. The experimental results demonstrate that the proposed method outperforms other solutions in terms of accuracy
引用
收藏
页码:353 / 361
页数:9
相关论文
共 50 条
  • [1] Encoding laparoscopic image to words using vision transformer for distortion classification and ranking in laparoscopic videos
    Nouar AlDahoul
    Hezerul Abdul Karim
    Mhd Adel Momo
    Myles Joshua Toledo Tan
    Jamie Ledesma Fermin
    Multimedia Tools and Applications, 2025, 84 (10) : 7159 - 7181
  • [2] Image Classification Using Vision Transformer for EtC Images
    Hamano, Genki
    Imaizumi, Shoko
    Kiya, Hitoshi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1506 - 1513
  • [3] The Application of Vision Transformer in Image Classification
    He, Zhixuan
    2022 THE 6TH INTERNATIONAL CONFERENCE ON VIRTUAL AND AUGMENTED REALITY SIMULATIONS, ICVARS 2022, 2022, : 56 - 63
  • [4] Privacy-Preserving Image Classification Using Vision Transformer
    Qi, Zheng
    MaungMaung, AprilPyone
    Kinoshita, Yuma
    Kiya, Hitoshi
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 543 - 547
  • [5] Hyperspectral Image Classification Using Groupwise Separable Convolutional Vision Transformer Network
    Zhao, Zhuoyi
    Xu, Xiang
    Li, Shutao
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
  • [6] Vision Transformer With Contrastive Learning for Hyperspectral Image Classification
    Zhou, Heng
    Zhang, Xin
    Zhang, Chunlei
    Ma, Qiaoyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [7] CSiT: A Multiscale Vision Transformer for Hyperspectral Image Classification
    He, Wenxuan
    Huang, Weiliang
    Liao, Shuhong
    Xu, Zhen
    Yan, Jingwen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 9266 - 9277
  • [8] Hyperspectral image classification with embedded linear vision transformer
    Yunfei Tan
    Ming Li
    Longfa Yuan
    Chaoshan Shi
    Yonghang Luo
    Guihao Wen
    Earth Science Informatics, 2025, 18 (1)
  • [9] Compressed-Domain Vision Transformer for Image Classification
    Ji, Ruolei
    Karam, Lina J.
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2024, 14 (02) : 299 - 310
  • [10] CrisisViT: A Robust Vision Transformer for Crisis Image Classification
    Long, Zijun
    McCreadie, Richard
    Imran, Muhammad
    Proceedings of the International ISCRAM Conference, 2023, 2023-text : 309 - 319