CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

被引:61
|
作者
Zhou, Xingran [1 ,2 ]
Zhang, Bo [2 ]
Zhang, Ting [2 ]
Zhang, Pan [4 ]
Bao, Jianmin [2 ]
Chen, Dong [2 ]
Zhang, Zhongfei [3 ]
Wen, Fang [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] SUNY Binghamton, Binghamton, NY 13902 USA
[4] USTC, Hefei, Anhui, Peoples R China
关键词
D O I
10.1109/CVPR46437.2021.01130
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the full-resolution correspondence learning for cross-domain images, which aids image translation. We adopt a hierarchical strategy that uses the correspondence from coarse level to guide the fine levels. At each hierarchy, the correspondence can be efficiently computed via PatchMatch that iteratively leverages the matchings from the neighborhood. Within each PatchMatch iteration, the ConvGRU module is employed to refine the current correspondence considering not only the matchings of larger context but also the historic estimates. The proposed CoCosNet v2, a GRU-assisted PatchMatch approach, is fully differentiable and highly efficient. When jointly trained with image translation, full-resolution semantic correspondence can be established in an unsupervised manner, which in turn facilitates the exemplar-based image translation. Experiments on diverse translation tasks show that CoCosNet v2 performs considerably better than state-of-the-art literature on producing high-resolution images.
引用
收藏
页码:11460 / 11470
页数:11
相关论文
共 50 条
  • [31] Non-local sparse attention based swin transformer V2 for image super-resolution
    Lv, Ningning
    Yuan, Min
    Xie, Yufei
    Zhan, Kun
    Lu, Fuxiang
    SIGNAL PROCESSING, 2024, 222
  • [32] A depth image acquisition platform based on Kinect V2
    Zhai, Yu
    Qu, Yanlin
    Xu, Peng
    Li, Mengyao
    Han, Shaokun
    AOPC 2021: OPTICAL SENSING AND IMAGING TECHNOLOGY, 2021, 12065
  • [33] 4D-CT deformable image registration using unsupervised recursive cascaded full-resolution residual networks
    Xu, Lei
    Jiang, Ping
    Tsui, Tiffany
    Liu, Junyan
    Zhang, Xiping
    Yu, Lequan
    Niu, Tianye
    BIOENGINEERING & TRANSLATIONAL MEDICINE, 2023, 8 (06)
  • [34] IMAGE TERMINAL GUIDANCE BASED ON YOLO V2 FRAMEWORK
    Lan Yixing
    Peng Ke
    Zhang Weihua
    Liu Xuancen
    FOURTH IAA CONFERENCE ON DYNAMICS AND CONTROL OF SPACE SYSTEMS 2018, PTS I-III, 2018, 165 : 651 - 666
  • [35] Snapshot compressive imaging based digital image correlation: temporally super-resolved full-resolution deformation measurement
    Chen, Wenwu
    Zhang, Bo
    Gu, Liuning
    Liu, Haibo
    Suo, Jinli
    Shao, Xinxing
    OPTICS EXPRESS, 2022, 30 (19): : 33554 - 33573
  • [36] Cross-domain Correspondence Learning for Exemplar-based Image Translation
    Zhang, Pan
    Zhang, Bo
    Chen, Dong
    Yuan, Lu
    Wen, Fang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5142 - 5152
  • [37] HIGH-RESOLUTION INFRARED-SPECTRA OF V1 + V2 AND V2 + V3 BANDS OF H2 O-16
    FLAUD, JM
    VALENTIN, A
    CAMYPEYR.C
    JOURNAL DE PHYSIQUE, 1972, 33 (8-9): : 741 - &
  • [38] Physiological correlates of perceptual learning in monkey V1 and V2
    Ghose, GM
    Yang, TM
    Maunsell, JHR
    JOURNAL OF NEUROPHYSIOLOGY, 2002, 87 (04) : 1867 - 1888
  • [39] DIIK-Net: A full-resolution cross-domain deep interaction convolutional neural network for MR image reconstruction
    Liu, Yu
    Pang, Yanwei
    Liu, Xiaohan
    Liu, Yiming
    Nie, Jing
    NEUROCOMPUTING, 2023, 517 : 213 - 222
  • [40] fMRI Reveals Visual Statistical Learning in Macaque V2
    Vergnieux, Victor
    Vogels, Rufin
    PERCEPTION, 2019, 48 : 92 - 92