CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

被引：61

作者：

Zhou, Xingran ^{[1
,2
]}

Zhang, Bo ^{[2
]}

Zhang, Ting ^{[2
]}

Zhang, Pan ^{[4
]}

Bao, Jianmin ^{[2
]}

Chen, Dong ^{[2
]}

Zhang, Zhongfei ^{[3
]}

Wen, Fang ^{[2
]}

机构：

[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China

[2] Microsoft Res Asia, Beijing, Peoples R China

[3] SUNY Binghamton, Binghamton, NY 13902 USA

[4] USTC, Hefei, Anhui, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.01130

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present the full-resolution correspondence learning for cross-domain images, which aids image translation. We adopt a hierarchical strategy that uses the correspondence from coarse level to guide the fine levels. At each hierarchy, the correspondence can be efficiently computed via PatchMatch that iteratively leverages the matchings from the neighborhood. Within each PatchMatch iteration, the ConvGRU module is employed to refine the current correspondence considering not only the matchings of larger context but also the historic estimates. The proposed CoCosNet v2, a GRU-assisted PatchMatch approach, is fully differentiable and highly efficient. When jointly trained with image translation, full-resolution semantic correspondence can be established in an unsupervised manner, which in turn facilitates the exemplar-based image translation. Experiments on diverse translation tasks show that CoCosNet v2 performs considerably better than state-of-the-art literature on producing high-resolution images.

引用

页码：11460 / 11470

页数：11

共 50 条

[41] Garbage image classification algorithm based on improved MobileNet v2
Chen Z.-C.
Jiao H.-N.
Yang J.
Zeng H.-F.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2021, 55 (08): : 1490 - 1499
[42] HWNet v2: an efficient word image representation for handwritten documents
Krishnan, Praveen
Jawahar, C., V
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (04) : 387 - 405
[43] HWNet v2: an efficient word image representation for handwritten documents
Praveen Krishnan
C. V. Jawahar
International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 387 - 405
[44] Mushroom image classification and recognition based on improved ConvNeXt V2
Zhang, Shulong
Zhao, Kexin
Huo, Yukang
Yao, Mingyuan
Xue, Lin
Wang, Haihua
JOURNAL OF FOOD SCIENCE, 2025, 90 (03)
[45] Contextual modulation of sensitivity to naturalistic image structure in macaque V2
Ziemba, Corey M.
Freeman, Jeremy
Simoncelli, Fero P.
Movshon, J. Anthony
JOURNAL OF NEUROPHYSIOLOGY, 2018, 120 (02) : 409 - 420
[46] Spectroscopic characterization of the v2=3 and v2 = v4=1 states for 15NH3 from high resolution infrared spectra
Cane, Elisabetta
Di Lonardo, Gianfranco
Fusina, Luciano
Tamassia, Filippo
Predoi-Cross, Adriana
JOURNAL OF QUANTITATIVE SPECTROSCOPY & RADIATIVE TRANSFER, 2020, 250
[47] Full parallax three-dimensional display from Kinect v1 and v2
Hong, Seokmin
Saavedra, Genaro
Martinez-Corral, Manuel
OPTICAL ENGINEERING, 2017, 56 (04)
[48] Asymmetric slack contrastive learning for full use of feature information in image translation
Zhang, Yusen
Li, Min
Gou, Yao
He, Yujie
KNOWLEDGE-BASED SYSTEMS, 2024, 299
[49] Development, learning, attention, and grouping by the laminar circuits of V1 and V2
Grossberg, S
EUROPEAN JOURNAL OF NEUROSCIENCE, 2000, 12 : 59 - 59
[50] Laminar substrates of attention, grouping and perceptual learning in V1 and V2
Raizada, RDS
Grossberg, S
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 1999, 40 (04) : S645 - S645

← 1 2 3 4 5 →