CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

被引:61
|
作者
Zhou, Xingran [1 ,2 ]
Zhang, Bo [2 ]
Zhang, Ting [2 ]
Zhang, Pan [4 ]
Bao, Jianmin [2 ]
Chen, Dong [2 ]
Zhang, Zhongfei [3 ]
Wen, Fang [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
[3] SUNY Binghamton, Binghamton, NY 13902 USA
[4] USTC, Hefei, Anhui, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.01130
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the full-resolution correspondence learning for cross-domain images, which aids image translation. We adopt a hierarchical strategy that uses the correspondence from coarse level to guide the fine levels. At each hierarchy, the correspondence can be efficiently computed via PatchMatch that iteratively leverages the matchings from the neighborhood. Within each PatchMatch iteration, the ConvGRU module is employed to refine the current correspondence considering not only the matchings of larger context but also the historic estimates. The proposed CoCosNet v2, a GRU-assisted PatchMatch approach, is fully differentiable and highly efficient. When jointly trained with image translation, full-resolution semantic correspondence can be established in an unsupervised manner, which in turn facilitates the exemplar-based image translation. Experiments on diverse translation tasks show that CoCosNet v2 performs considerably better than state-of-the-art literature on producing high-resolution images.
引用
收藏
页码:11460 / 11470
页数:11
相关论文
共 50 条
  • [41] Garbage image classification algorithm based on improved MobileNet v2
    Chen Z.-C.
    Jiao H.-N.
    Yang J.
    Zeng H.-F.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2021, 55 (08): : 1490 - 1499
  • [42] HWNet v2: an efficient word image representation for handwritten documents
    Krishnan, Praveen
    Jawahar, C., V
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (04) : 387 - 405
  • [43] HWNet v2: an efficient word image representation for handwritten documents
    Praveen Krishnan
    C. V. Jawahar
    International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 387 - 405
  • [44] Mushroom image classification and recognition based on improved ConvNeXt V2
    Zhang, Shulong
    Zhao, Kexin
    Huo, Yukang
    Yao, Mingyuan
    Xue, Lin
    Wang, Haihua
    JOURNAL OF FOOD SCIENCE, 2025, 90 (03)
  • [45] Contextual modulation of sensitivity to naturalistic image structure in macaque V2
    Ziemba, Corey M.
    Freeman, Jeremy
    Simoncelli, Fero P.
    Movshon, J. Anthony
    JOURNAL OF NEUROPHYSIOLOGY, 2018, 120 (02) : 409 - 420
  • [46] Spectroscopic characterization of the v2=3 and v2 = v4=1 states for 15NH3 from high resolution infrared spectra
    Cane, Elisabetta
    Di Lonardo, Gianfranco
    Fusina, Luciano
    Tamassia, Filippo
    Predoi-Cross, Adriana
    JOURNAL OF QUANTITATIVE SPECTROSCOPY & RADIATIVE TRANSFER, 2020, 250
  • [47] Full parallax three-dimensional display from Kinect v1 and v2
    Hong, Seokmin
    Saavedra, Genaro
    Martinez-Corral, Manuel
    OPTICAL ENGINEERING, 2017, 56 (04)
  • [48] Asymmetric slack contrastive learning for full use of feature information in image translation
    Zhang, Yusen
    Li, Min
    Gou, Yao
    He, Yujie
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [49] Development, learning, attention, and grouping by the laminar circuits of V1 and V2
    Grossberg, S
    EUROPEAN JOURNAL OF NEUROSCIENCE, 2000, 12 : 59 - 59
  • [50] Laminar substrates of attention, grouping and perceptual learning in V1 and V2
    Raizada, RDS
    Grossberg, S
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1999, 40 (04) : S645 - S645