I2V-GAN: Unpaired Infrared-to-Visible Video Translation

被引:31
|
作者
Li, Shuang [1 ]
Han, Bingfeng [1 ]
Yu, Zhenjie [1 ]
Liu, Chi Harold [1 ]
Chen, Kai [2 ]
Wang, Shuigen [2 ]
机构
[1] Beijing Inst Technol, Beijing, Peoples R China
[2] Yantai IRay Technol Lt Co, Jinan, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Infrared-to-Visible; Video-to-Video Translation; GANs;
D O I
10.1145/3474085.3475445
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human vision is often adversely affected by complex environmental factors, especially in night vision scenarios. Thus, infrared cameras are often leveraged to help enhance the visual effects via detecting infrared radiation in the surrounding environment, but the infrared videos are undesirable due to the lack of detailed semantic information. In such a case, an effective video-to-video translation method from the infrared domain to the visible light counterpart is strongly needed by overcoming the intrinsic huge gap between infrared and visible fields. To address this challenging problem, we propose an infrared-to-visible (I2V) video translation method I2V-GAN to generate fine-grained and spatial-temporal consistent visible light videos by given unpaired infrared videos. Technically, our model capitalizes on three types of constraints: 1) adversarial constraint to generate synthetic frames that are similar to the real ones, 2) cyclic consistency with the introduced perceptual loss for effective content conversion as well as style preservation, and 3) similarity constraints across and within domains to enhance the content and motion consistency in both spatial and temporal spaces at a fine-grained level. Furthermore, the current public available infrared and visible light datasets are mainly used for object detection or tracking, and some are composed of discontinuous images which are not suitable for video tasks. Thus, we provide a new dataset for infrared-to-visible video translation, which is named IRVI. Specifically, it has 12 consecutive video clips of vehicle and monitoring scenes, and both infrared and visible light videos could be apart into 24352 frames. Comprehensive experiments on IRVI validate that I2V-GAN is superior to the compared state-of-the-art methods in the translation of infrared-to-visible videos with higher fluency and finer semantic details. Moreover, additional experimental results on the flower-to-flower dataset indicate I2V-GAN is also applicable to other video translation tasks. The code and IRVI dataset are available at https://github.com/BIT-DA/I2V-GAN.
引用
收藏
页码:3061 / 3069
页数:9
相关论文
共 50 条
  • [31] Dynamics of the infrared-to-visible upconversion in an Er3+-doped KPb2Br5 crystal -: art. no. 165116
    Garcia-Adeva, AJ
    Balda, R
    Fernández, J
    Nyein, EE
    Hömmerich, U
    PHYSICAL REVIEW B, 2005, 72 (16)
  • [32] Infrared-to-visible upconversion emission in Er3+ doped TeO2-WO3-Bi2O3 glasses with silver nanoparticles
    de Campos, Vitor P. P.
    Kassab, Luciana R. P.
    de Assumpcao, Thiago A. A.
    da Silva, Diego S.
    de Araujo, Cid B.
    JOURNAL OF APPLIED PHYSICS, 2012, 112 (06)
  • [33] Infrared-to-visible frequency up-conversion in trivalent erbium ions doped fluoride glasses (ZnF2-AlF3-PbF2-LiF)
    Qin, GS
    Qin, WP
    Huang, SH
    Wu, CF
    Liu, HQ
    JOURNAL OF RARE EARTHS, 2003, 21 (03) : 315 - 317
  • [34] Infrared-to-Visible Frequency up-Conversion in Trivalent Erbium Ions Doped Fluoride Glasses (ZnF2-AlF3-PbF2-LiF)
    秦冠仕
    秦伟平
    黄世华
    吴长峰
    赵丹
    刘晃清
    Journal of Rare Earths, 2003, (03) : 315 - 317
  • [35] Theoretical lifetimes and Stark broadening parameters for visible-infrared spectral lines of V i in Arcturus
    Isidoro-Garcia, L.
    De Andres-Garcia, I
    Moreno-Conde, D.
    Colon, C.
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2022, 509 (03) : 4538 - 4554
  • [36] Infrared-to-visible upconversion in Yb3+/Er3+ co-doped PbO-GeO2 glass with silver nanoparticles
    Bomfim, F. A.
    Martinelli, J. R.
    Kassab, L. R. P.
    Assumpcao, T. A. A.
    de Araujo, C. B.
    JOURNAL OF NON-CRYSTALLINE SOLIDS, 2010, 356 (44-49) : 2598 - 2601
  • [37] Preparation of Y2O3:Yb,Er infrared-to-visible conversion phosphor fine particles using an emulsion liquid membrane system
    Hirai, T
    Orikoshi, T
    Komasawa, I
    CHEMISTRY OF MATERIALS, 2002, 14 (08) : 3576 - 3583
  • [38] TRANSITION STRENGTHS IN THE VISIBLE-INFRARED ABSORPTION-SPECTRUM OF I2
    TELLINGHUISEN, J
    JOURNAL OF CHEMICAL PHYSICS, 1982, 76 (10): : 4736 - 4744
  • [39] Fabrication and efficient infrared-to-visible upconversion in transparent glass ceramics of Er-Yb co-doped CaF2 nano-crystals
    Kishi, Y
    Tanabe, S
    Tochino, S
    Pezzotti, G
    JOURNAL OF THE AMERICAN CERAMIC SOCIETY, 2005, 88 (12) : 3423 - 3426
  • [40] Performance Modelling and Evaluation of V2I Video Surveillance System
    Pokhrel, Shiva Raj
    Pandey, Ram Chandra
    Joshi, Shashidhar Ram
    2015 TWELFTH INTERNATIONAL CONFERENCE ON WIRELESS AND OPTICAL COMMUNICATIONS NETWORKS (WOCN), 2015,