GAN-based Image Translation Model with Self-Attention for Nighttime Dashcam Data Augmentation

被引:2
|
作者
Sultana, Rebeka [1 ]
Ohashi, Gosuke [2 ]
机构
[1] Shizuoka Univ, Grad Sch Sci & Technol, Hamamatsu 4328561, Japan
[2] Shizuoka Univ, Dept Elect & Elect Engn, Hamamatsu 4328561, Japan
关键词
GAN; image-to-image translation; self-attention; data augmen-tation; nighttime dashcam image; object detection; ADAS;
D O I
10.1587/transfun.2022IMP0004
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
High-performance deep learning-based object detection models can reduce traffic accidents using dashcam images during nighttime driving. Deep learning requires a large-scale dataset to obtain a highperformance model. However, existing object detection datasets are mostly daytime scenes and a few nighttime scenes. Increasing the nighttime dataset is laborious and time-consuming. In such a case, it is possible to convert daytime images to nighttime images by image-to-image translation model to augment the nighttime dataset with less effort so that the translated dataset can utilize the annotations of the daytime dataset. Therefore, in this study, a GAN-based image-to-image translation model is proposed by incorporating self-attention with cycle consistency and content/style separation for nighttime data augmentation that shows high fidelity to annotations of the daytime dataset. Experimental results highlight the effectiveness of the proposed model compared with other models in terms of translated images and FID scores. Moreover, the high fidelity of translated images to the annotations is verified by a small object detection model according to detection results and mAP. Ablation studies confirm the effectiveness of self-attention in the proposed model. As a contribution to GAN-based data augmentation, the source code of the proposed image translation model is publicly available at https://github.com/subecky/Image-Translation-With-Self-Attention
引用
收藏
页码:1202 / 1210
页数:9
相关论文
共 50 条
  • [21] GAN with opposition-based blocks and channel self-attention mechanism for image synthesis
    Liu, Gang
    Ke, Aihua
    Wu, Xinyun
    Zhang, Haifeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [22] Self-attention StarGAN for Multi-domain Image-to-Image Translation
    He, Ziliang
    Yang, Zhenguo
    Mao, Xudong
    Lv, Jianming
    Li, Qing
    Liu, Wenyin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 537 - 549
  • [23] Self-Ensembling with GAN-based Data Augmentation for Domain Adaptation in Semantic Segmentation
    Choi, Jaehoon
    Kim, Taekyung
    Kim, Changick
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6829 - 6839
  • [24] Nighttime Lane Detection Based on Retinex Theory and Self-Attention Distillation
    Wang, Jingpin
    Ge, Yuan
    Han, Chao
    Ye, Gang
    Zhao, Jie
    Chang, Tingting
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1801 - 1806
  • [25] GAN-based unpaired image-to-image translation for maritime imagery
    Mediavilla, Chelsea
    Sato, Jonathan
    Manzanares, Mitch
    Dotter, Marissa
    Parameswaran, Shibin
    GEOSPATIAL INFORMATICS X, 2020, 11398
  • [26] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
    Ayyub Alzahem
    Wadii Boulila
    Anis Koubaa
    Zahid Khan
    Ibrahim Alturki
    Earth Science Informatics, 2023, 16 : 4169 - 4186
  • [27] COViT-GAN: Vision Transformer for COVID-19 Detection in CT Scan Images with Self-Attention GAN for Data Augmentation
    Ambita, Ara Abigail E.
    Boquio, Eujene Nikka, V
    Naval, Prospero C., Jr.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 587 - 598
  • [28] A GAN-Based Framework Combining Memory and Self-Attention Mechanisms for Video Anomaly Detection in Online Gaming Environments
    Xiong L.-T.
    Ou B.
    Cheng Z.-P.
    Computer-Aided Design and Applications, 2024, 21 (s5): : 91 - 105
  • [29] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
    Alzahem, Ayyub
    Boulila, Wadii
    Koubaa, Anis
    Khan, Zahid
    Alturki, Ibrahim
    EARTH SCIENCE INFORMATICS, 2023, 16 (04) : 4169 - 4186
  • [30] GAN-Based Data Augmentation for Visual Finger Spelling Recognition
    Kwolek, Bogdan
    ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041