GAN-based Image Translation Model with Self-Attention for Nighttime Dashcam Data Augmentation

被引：2

作者：

Sultana, Rebeka ^{[1
]}

Ohashi, Gosuke ^{[2
]}

机构：

[1] Shizuoka Univ, Grad Sch Sci & Technol, Hamamatsu 4328561, Japan

[2] Shizuoka Univ, Dept Elect & Elect Engn, Hamamatsu 4328561, Japan

来源：

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2023年 / E106A卷 / 09期

关键词：

GAN; image-to-image translation; self-attention; data augmen-tation; nighttime dashcam image; object detection; ADAS;

D O I：

10.1587/transfun.2022IMP0004

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

High-performance deep learning-based object detection models can reduce traffic accidents using dashcam images during nighttime driving. Deep learning requires a large-scale dataset to obtain a highperformance model. However, existing object detection datasets are mostly daytime scenes and a few nighttime scenes. Increasing the nighttime dataset is laborious and time-consuming. In such a case, it is possible to convert daytime images to nighttime images by image-to-image translation model to augment the nighttime dataset with less effort so that the translated dataset can utilize the annotations of the daytime dataset. Therefore, in this study, a GAN-based image-to-image translation model is proposed by incorporating self-attention with cycle consistency and content/style separation for nighttime data augmentation that shows high fidelity to annotations of the daytime dataset. Experimental results highlight the effectiveness of the proposed model compared with other models in terms of translated images and FID scores. Moreover, the high fidelity of translated images to the annotations is verified by a small object detection model according to detection results and mAP. Ablation studies confirm the effectiveness of self-attention in the proposed model. As a contribution to GAN-based data augmentation, the source code of the proposed image translation model is publicly available at https://github.com/subecky/Image-Translation-With-Self-Attention

引用

页码：1202 / 1210

页数：9

共 50 条

[21] GAN with opposition-based blocks and channel self-attention mechanism for image synthesis
Liu, Gang
Ke, Aihua
Wu, Xinyun
Zhang, Haifeng
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
[22] Self-attention StarGAN for Multi-domain Image-to-Image Translation
He, Ziliang
Yang, Zhenguo
Mao, Xudong
Lv, Jianming
Li, Qing
Liu, Wenyin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: IMAGE PROCESSING, PT III, 2019, 11729 : 537 - 549
[23] Self-Ensembling with GAN-based Data Augmentation for Domain Adaptation in Semantic Segmentation
Choi, Jaehoon
Kim, Taekyung
Kim, Changick
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6829 - 6839
[24] Nighttime Lane Detection Based on Retinex Theory and Self-Attention Distillation
Wang, Jingpin
Ge, Yuan
Han, Chao
Ye, Gang
Zhao, Jie
Chang, Tingting
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1801 - 1806
[25] GAN-based unpaired image-to-image translation for maritime imagery
Mediavilla, Chelsea
Sato, Jonathan
Manzanares, Mitch
Dotter, Marissa
Parameswaran, Shibin
GEOSPATIAL INFORMATICS X, 2020, 11398
[26] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
Ayyub Alzahem
Wadii Boulila
Anis Koubaa
Zahid Khan
Ibrahim Alturki
Earth Science Informatics, 2023, 16 : 4169 - 4186
[27] COViT-GAN: Vision Transformer for COVID-19 Detection in CT Scan Images with Self-Attention GAN for Data Augmentation
Ambita, Ara Abigail E.
Boquio, Eujene Nikka, V
Naval, Prospero C., Jr.
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 587 - 598
[28] A GAN-Based Framework Combining Memory and Self-Attention Mechanisms for Video Anomaly Detection in Online Gaming Environments
Xiong L.-T.
Ou B.
Cheng Z.-P.
Computer-Aided Design and Applications, 2024, 21 (s5): : 91 - 105
[29] Improving satellite image classification accuracy using GAN-based data augmentation and vision transformers
Alzahem, Ayyub
Boulila, Wadii
Koubaa, Anis
Khan, Zahid
Alturki, Ibrahim
EARTH SCIENCE INFORMATICS, 2023, 16 (04) : 4169 - 4186
[30] GAN-Based Data Augmentation for Visual Finger Spelling Recognition
Kwolek, Bogdan
ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041

← 1 2 3 4 5 →