Distilling Vision Transformers for no-reference Perceptual CT Image Quality Assessment

被引:0
|
作者
Baldeon-Calisto, Maria G. [1 ,2 ,5 ]
Rivera-Velastegui, Francisco [3 ]
Lai-Yuen, Susana K. [4 ]
Riofrio, Daniel [5 ]
Perez-Perez, Noel [5 ]
Benitez, Diego [5 ]
Flores-Moyano, Ricardo [5 ]
机构
[1] Univ San Francisco Quito USFQ, Dept Ingn Ind, Quito 170157, Ecuador
[2] Univ San Francisco Quito USFQ, Inst Innovac Prod & Logist CATENA USFQ, Quito 170157, Ecuador
[3] Univ Int Ecuador UIDE, Dept Invest & Postgrad, Quito, Ecuador
[4] Univ S Florida, Dept Ind & Management Syst, Tampa, FL USA
[5] Univ San Francisco Quito USFQ, Colegio Ciencias & Ingn Politecn, Quito 170157, Ecuador
来源
关键词
Vision Transformers; Transformer model distillation; Medical Image Classification; Low-dose Computed Tomography; Image Quality Assessment;
D O I
10.1117/12.3004838
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Image quality assessment of CT scans is of utmost importance in balancing radiation dose and image quality. Nonetheless, estimating the image quality of CT scans is a highly subjective task that cannot be adequately captured by a single quantitative metric. In this work, we present a novel vision Transformer network for no-reference CT image quality assessment. Our network combines convolutional operations and multi-head self-attention mechanisms by adding a powerful convolutional stem in the beginning of the traditional ViT network. To enhance the performance and efficiency of the network, we introduce a distillation methodology, comprised of two sequential steps. In Step I, we construct a "teacher ensemble network" by training five Vision Transformer networks using a five-fold division schema. In Step II, we train a single vision Transformer, referred to as the "student network", by using the teacher's predictions as new labels. The student network is also optimized using the original labeled dataset. The effectiveness of the proposed model is evaluated on the task of predicting image quality scores from low-dose abdominal CT images from the LDCTIQAC2023 Grand Challenge. Our model demonstrates remarkable performance, ranking 6th during the testing phase of the challenge. Additionally, our experiments highlight the effectiveness of incorporating a convolutional stem in the ViT architecture and the distillation methodology.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] DistilIQA: Distilling Vision Transformers for no-reference perceptual CT image quality assessment
    Baldeon-Calisto, Maria
    Rivera-Velastegui, Francisco
    Lai-Yuen, Susana K.
    Riofrío, Daniel
    Pérez-Pérez, Noel
    Benítez, Diego
    Flores-Moyano, Ricardo
    [J]. Computers in Biology and Medicine, 2024, 177
  • [2] A no-reference perceptual image quality assessment database for learned image codecs
    Zhang, Jiaqi
    Fang, Zhigao
    Yu, Lu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [3] No-reference color image quality assessment: from entropy to perceptual quality
    Xiaoqiao Chen
    Qingyi Zhang
    Manhui Lin
    Guangyi Yang
    Chu He
    [J]. EURASIP Journal on Image and Video Processing, 2019
  • [4] No-reference color image quality assessment: from entropy to perceptual quality
    Chen, Xiaoqiao
    Zhang, Qingyi
    Lin, Manhui
    Yang, Guangyi
    He, Chu
    [J]. EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2019, 2019 (01)
  • [5] Combining CNN and transformers for full-reference and no-reference image quality assessment
    Zeng, Chao
    Kwong, Sam
    [J]. NEUROCOMPUTING, 2023, 549
  • [6] No-reference perceptual CT image quality assessment based on a self-supervised learning framework
    Lee, Wonkyeong
    Cho, Eunbyeol
    Kim, Wonjin
    Choi, Hyebin
    Beck, Kyongmin Sarah
    Yoon, Hyun Jung
    Baek, Jongduk
    Choi, Jang-Hwan
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (04):
  • [7] No-Reference Image Quality Assessment Based on the Fusion of Statistical and Perceptual Features
    Varga, Domonkos
    [J]. JOURNAL OF IMAGING, 2020, 6 (08)
  • [8] A No-Reference Image Quality Assessment
    Kemalkar, Aniket K.
    Bairagi, Vinayak K.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN COMPUTING, COMMUNICATION AND NANOTECHNOLOGY (ICE-CCN'13), 2013, : 462 - 465
  • [9] No-Reference Stereoscopic Image Quality Assessment Based on Image Distortion and Stereo Perceptual Information
    Shen, Liquan
    Fang, Ruigang
    Yao, Yang
    Geng, Xianqiu
    Wu, Dapeng
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2019, 3 (01): : 59 - 72
  • [10] Perceptual Image Quality Assessment with Transformers
    Cheon, Manri
    Yoon, Sung-Jun
    Kang, Byungyeon
    Lee, Junwoo
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 433 - 442