An Overview of Vision Transformers for Image Processing: A Survey

被引:0
|
作者
Kameswari, Ch. Sita [1 ]
Kavitha, J. [2 ]
Reddy, T. Srinivas [3 ]
Chinthaguntla, Balaswamy [4 ]
Jagatheesaperumal, Senthil Kumar [5 ]
Gaftandzhieva, Silvia [6 ]
Doneva, Rositsa [6 ]
机构
[1] Keshav Mem Inst Technol, Dept Comp Sci & Engn AI&ML, Hyderabad, India
[2] BVRIT HYDERABAD Coll Engn Women, Dept Informat Technol, Hyderabad, India
[3] Malla Reddy Engn Coll, Dept Elect & Commun Engn, Secunderabad, India
[4] Sheshadri Rao Gudlavalleru Engn Coll, Dept Elect & Commun Engn, Gudlavalleru, India
[5] Mepco Schlenk Engn Coll, Dept Elect & Commun Engn, Sivakasi 626005, India
[6] Univ Plovdiv Paisii Hilendarski, Plovdiv, Bulgaria
关键词
Vision transformers; image processing; natural language processing; image;
D O I
10.14569/IJACSA.2023.0140830
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Using image processing technology has become increasingly essential in the education sector, with universities and educational institutions exploring innovative ways to enhance their teaching techniques and provide a better learning experience for their students. Vision transformer-based models have been highly successful in various domains of artificial intelligence, including natural language processing and computer vision, which have generated significant interest from academic and industrial researchers. These models have outperformed other networks like convolutional and recurrent networks in visual benchmarks, making them a promising candidate for image processing applications. This article presents a comprehensive survey of vision transformer models for image processing and computer vision, focusing on their potential applications for student verification in university systems. The models can analyze biometric data like student ID cards and facial recognition to ensure that students are accurately verified in real-time, becoming increasingly vital as online learning continues to gain traction. By accurately verifying the identity of students, universities and educational institutions can guarantee that students have access to relevant learning materials and resources necessary for their academic success.
引用
收藏
页码:273 / 289
页数:17
相关论文
共 50 条
  • [1] Vision Transformers in Image Restoration: A Survey
    Ali, Anas M.
    Benjdira, Bilel
    Koubaa, Anis
    El-Shafai, Walid
    Khan, Zahid
    Boulila, Wadii
    [J]. SENSORS, 2023, 23 (05)
  • [2] Multivariate image processing in minerals engineering with vision transformers
    Liu, Xiu
    Aldrich, Chris
    [J]. MINERALS ENGINEERING, 2024, 208
  • [3] A Survey on Transformers for Point Cloud Processing: An Updated Overview
    Zeng, Jiahao
    Wang, Decheng
    Chen, Peng
    [J]. IEEE ACCESS, 2022, 10 : 86510 - 86527
  • [4] Transformers in Vision: A Survey
    Khan, Salman
    Naseer, Muzammal
    Hayat, Munawar
    Zamir, Syed Waqas
    Khan, Fahad Shahbaz
    Shah, Mubarak
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (10S)
  • [5] Vision transformers for dense prediction: A survey
    Zuo, Shuangquan
    Xiao, Yun
    Chang, Xiaojun
    Wang, Xuanhong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 253
  • [6] A Comprehensive Survey of Transformers for Computer Vision
    Jamil, Sonain
    Piran, Md. Jalil
    Kwon, Oh-Jin
    [J]. DRONES, 2023, 7 (05)
  • [7] Vision Transformers for Single Image Dehazing
    Song, Yuda
    He, Zhuqing
    Qian, Hui
    Du, Xin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1927 - 1941
  • [8] Boosting vision transformers for image retrieval
    Song, Chull Hwan
    Yoon, Jooyoung
    Choi, Shunghyun
    Avrithis, Yannis
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 107 - 117
  • [9] Semantic segmentation using Vision Transformers: A survey
    Thisanke, Hans
    Deshan, Chamli
    Chamith, Kavindu
    Seneviratne, Sachith
    Vidanaarachchi, Rajith
    Herath, Damayanthi
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [10] AN OVERVIEW OF VISION PROCESSING IN IMPLANTABLE PROSTHETIC VISION
    Barnes, Nick
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1532 - 1535