CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

被引:61
|
作者
Zhou, Xingran [1 ,2 ]
Zhang, Bo [2 ]
Zhang, Ting [2 ]
Zhang, Pan [4 ]
Bao, Jianmin [2 ]
Chen, Dong [2 ]
Zhang, Zhongfei [3 ]
Wen, Fang [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] SUNY Binghamton, Binghamton, NY 13902 USA
[4] USTC, Hefei, Anhui, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.01130
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the full-resolution correspondence learning for cross-domain images, which aids image translation. We adopt a hierarchical strategy that uses the correspondence from coarse level to guide the fine levels. At each hierarchy, the correspondence can be efficiently computed via PatchMatch that iteratively leverages the matchings from the neighborhood. Within each PatchMatch iteration, the ConvGRU module is employed to refine the current correspondence considering not only the matchings of larger context but also the historic estimates. The proposed CoCosNet v2, a GRU-assisted PatchMatch approach, is fully differentiable and highly efficient. When jointly trained with image translation, full-resolution semantic correspondence can be established in an unsupervised manner, which in turn facilitates the exemplar-based image translation. Experiments on diverse translation tasks show that CoCosNet v2 performs considerably better than state-of-the-art literature on producing high-resolution images.
引用
收藏
页码:11460 / 11470
页数:11
相关论文
共 50 条
  • [21] High resolution FTIR study of 34S16O2: The bands 2v3, 2v1 + v2 and 2v1 + v2 - v2
    Ulenikov, O. N.
    Gromova, O. V.
    Bekhtereva, E. S.
    Krivchikova, Yu V.
    Sklyarova, E. A.
    Buttersack, T.
    Sydow, C.
    Bauerecker, S.
    JOURNAL OF MOLECULAR SPECTROSCOPY, 2015, 318 : 26 - 33
  • [22] HIGH-RESOLUTION INFRARED-SPECTRUM OF THE V2 + V3 AND V1 + V2 BANDS OF OZONE
    BARBE, A
    SECROUN, C
    JOUVE, P
    CAMYPEYRET, C
    FLAUD, JM
    JOURNAL OF MOLECULAR SPECTROSCOPY, 1979, 75 (01) : 103 - 110
  • [23] V2 Walk-through a Stitched Image
    Kiran, Geetha A.
    Murali, S.
    2014 FIFTH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2014), 2014, : 225 - 230
  • [24] A deep supervised transformer U-shaped full-resolution residual network for the segmentation of breast ultrasound image
    Zhou, Jiale
    Hou, Zuoxun
    Lu, Hongyan
    Wang, Wenhan
    Zhao, Wanchen
    Wang, Zenan
    Zheng, Dezhi
    Wang, Shuai
    Tang, Wenzhong
    Qu, Xiaolei
    MEDICAL PHYSICS, 2023, 50 (12) : 7513 - 7524
  • [25] Full-resolution image restoration for light field images via a spatial shift-variant degradation network
    Zhu, Conghui
    Jiang, Yi
    Yuan, Yan
    Su, Lijuan
    Yin, Xiaorui
    Kong, Deqian
    OPTICS EXPRESS, 2024, 32 (04) : 5362 - 5379
  • [26] Accurate full-resolution reconstruction of spike-encoded image time series using random matrix theory
    Chen, Rui
    Yang, Changshui
    Jia, Huizhu
    Huang, Tiejun
    ELECTRONICS LETTERS, 2019, 55 (04) : 182 - 183
  • [27] Swin Transformer V2: Scaling Up Capacity and Resolution
    Liu, Ze
    Hu, Han
    Lin, Yutong
    Yao, Zhuliang
    Xie, Zhenda
    Wei, Yixuan
    Ning, Jia
    Cao, Yue
    Zhang, Zheng
    Dong, Li
    Wei, Furu
    Guo, Baining
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11999 - 12009
  • [28] Translation of Sign Language Into Text Using Kinect for Windows v2
    Amatya, Preeti
    Sergieieva, Kateryna
    Meixner, Gerrit
    ACHI 2018: THE ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER-HUMAN INTERACTIONS, 2018, : 19 - 26
  • [29] BridgeData V2: A Dataset for Robot Learning at Scale
    Walke, Homer
    Black, Kevin
    Lee, Abraham
    Kim, Moo Jin
    Du, Max
    Zheng, Chongyi
    Zhao, Tony
    Hansen-Estruch, Philippe
    Vuong, Quan
    He, Andre
    Myers, Vivek
    Fang, Kuan
    Finn, Chelsea
    Levine, Sergey
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [30] HIGH-RESOLUTION SPECTRA OF V1 + V3 AND (V1 + V2 + V3)-V2 BANDS OF SO2
    BARBE, A
    SECROUN, C
    JOUVE, P
    DUTERAGE, B
    MONNANTEUIL, N
    BELLET, J
    STEENBECKELIERS, G
    JOURNAL OF MOLECULAR SPECTROSCOPY, 1975, 55 (1-3) : 319 - 350