Deep learning for thermal-RGB image-to-image translation

被引:0
|
作者
Wadsworth, Emma [1 ]
Mahajan, Advait [1 ]
Prasad, Raksha [1 ]
Menon, Rajesh [1 ]
机构
[1] Univ Utah, Elect & Comp Engn Dept, 50 Cent Campus Dr, Salt Lake City, UT 84112 USA
关键词
Deep learning; Thermal infrared image; Colorization; Image processing;
D O I
10.1016/j.infrared.2024.105442
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Thermal infrared (TIR) and visible (RGB) images offer complementary information, and present different challenges. We report a deep learning approach to translate images from TIR to RGB, and vice-versa, enabling the extraction of information from both modalities with a single camera. We collected a diverse dataset of approximate to 150, , 000 TIR-RGB image pairs, encompassing day, night, dusk, and a variety of scenes. The pix2pix network achieved average structural similarity index (SSIM) of 0.58 for TIR-to-RGB and 0.72 for RGB-to-TIR, outperforming unpaired-data translation. Our contributions include: (1) creating the largest and most diverse publicly available dataset of TIR-RGB image pairs, and (2) demonstrating efficient translation between TIR and RGB image modalities across diverse global priors. This work advances applications including low-light imaging, thermal sensing, and multi-band scene reconstruction with a single camera.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Comparison of Deep Learning Image-to-image Models for Medical Image Translation
    Yang, Zeyu
    Zoellner, Frank G.
    [J]. BILDVERARBEITUNG FUR DIE MEDIZIN 2024, 2024, : 344 - 349
  • [2] Deliberation Learning for Image-to-Image Translation
    He, Tianyu
    Xia, Yingce
    Lin, Jianxin
    Tan, Xu
    He, Di
    Qin, Tao
    Chen, Zhibo
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2484 - 2490
  • [3] Unaligned Image-to-Image Translation by Learning to Reweight
    Xie, Shaoan
    Gong, Mingming
    Xu, Yanwu
    Zhang, Kun
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14154 - 14164
  • [4] Contrastive learning for unsupervised image-to-image translation
    Lee, Hanbit
    Seol, Jinseok
    Lee, Sang-goo
    Park, Jaehui
    Shim, Junho
    [J]. APPLIED SOFT COMPUTING, 2024, 151
  • [5] RL-I2IT: Image-to-Image Translation with Deep Reinforcement Learning
    Wang, Xin
    Luo, Ziwei
    Hu, Jing
    Feng, Chengming
    Hu, Shu
    Zhu, Bin
    Wu, Xi
    Zhu, Hongtu
    Li, Xin
    Lyu, Siwei
    [J]. arXiv, 2023,
  • [6] DualGAN: Unsupervised Dual Learning for Image-to-Image Translation
    Yi, Zili
    Zhang, Hao
    Tan, Ping
    Gong, Minglun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2868 - 2876
  • [7] Dual Contrastive Learning for Unsupervised Image-to-Image Translation
    Han, Junlin
    Shoeiby, Mehrdad
    Petersson, Lars
    Armin, Mohammad Ali
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 746 - 755
  • [8] Improving Learning time in Unsupervised Image-to-Image Translation
    Min, Tae-Hong
    Kim, Do-Yun
    Choi, Young-June
    [J]. 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 455 - 458
  • [9] Theoretical Analysis of Image-to-Image Translation with Adversarial Learning
    Pan, Xudong
    Zhang, Mi
    Ding, Daizong
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [10] TraVeLGAN: Image-to-image Translation by Transformation Vector Learning
    Amodio, Matthew
    Krishnaswamy, Smita
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8975 - 8984