CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation

被引：61

作者：

Zhou, Xingran ^{[1
,2
]}

Zhang, Bo ^{[2
]}

Zhang, Ting ^{[2
]}

Zhang, Pan ^{[4
]}

Bao, Jianmin ^{[2
]}

Chen, Dong ^{[2
]}

Zhang, Zhongfei ^{[3
]}

Wen, Fang ^{[2
]}

机构：

[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China

[2] Microsoft Res Asia, Beijing, Peoples R China

[3] SUNY Binghamton, Binghamton, NY 13902 USA

[4] USTC, Hefei, Anhui, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.01130

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present the full-resolution correspondence learning for cross-domain images, which aids image translation. We adopt a hierarchical strategy that uses the correspondence from coarse level to guide the fine levels. At each hierarchy, the correspondence can be efficiently computed via PatchMatch that iteratively leverages the matchings from the neighborhood. Within each PatchMatch iteration, the ConvGRU module is employed to refine the current correspondence considering not only the matchings of larger context but also the historic estimates. The proposed CoCosNet v2, a GRU-assisted PatchMatch approach, is fully differentiable and highly efficient. When jointly trained with image translation, full-resolution semantic correspondence can be established in an unsupervised manner, which in turn facilitates the exemplar-based image translation. Experiments on diverse translation tasks show that CoCosNet v2 performs considerably better than state-of-the-art literature on producing high-resolution images.

引用

页码：11460 / 11470

页数：11

共 50 条

[31] Non-local sparse attention based swin transformer V2 for image super-resolution
Lv, Ningning
Yuan, Min
Xie, Yufei
Zhan, Kun
Lu, Fuxiang
SIGNAL PROCESSING, 2024, 222
[32] A depth image acquisition platform based on Kinect V2
Zhai, Yu
Qu, Yanlin
Xu, Peng
Li, Mengyao
Han, Shaokun
AOPC 2021: OPTICAL SENSING AND IMAGING TECHNOLOGY, 2021, 12065
[33] 4D-CT deformable image registration using unsupervised recursive cascaded full-resolution residual networks
Xu, Lei
Jiang, Ping
Tsui, Tiffany
Liu, Junyan
Zhang, Xiping
Yu, Lequan
Niu, Tianye
BIOENGINEERING & TRANSLATIONAL MEDICINE, 2023, 8 (06)
[34] IMAGE TERMINAL GUIDANCE BASED ON YOLO V2 FRAMEWORK
Lan Yixing
Peng Ke
Zhang Weihua
Liu Xuancen
FOURTH IAA CONFERENCE ON DYNAMICS AND CONTROL OF SPACE SYSTEMS 2018, PTS I-III, 2018, 165 : 651 - 666
[35] Snapshot compressive imaging based digital image correlation: temporally super-resolved full-resolution deformation measurement
Chen, Wenwu
Zhang, Bo
Gu, Liuning
Liu, Haibo
Suo, Jinli
Shao, Xinxing
OPTICS EXPRESS, 2022, 30 (19): : 33554 - 33573
[36] Cross-domain Correspondence Learning for Exemplar-based Image Translation
Zhang, Pan
Zhang, Bo
Chen, Dong
Yuan, Lu
Wen, Fang
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5142 - 5152
[37] HIGH-RESOLUTION INFRARED-SPECTRA OF V1 + V2 AND V2 + V3 BANDS OF H2 O-16
FLAUD, JM
VALENTIN, A
CAMYPEYR.C
JOURNAL DE PHYSIQUE, 1972, 33 (8-9): : 741 - &
[38] Physiological correlates of perceptual learning in monkey V1 and V2
Ghose, GM
Yang, TM
Maunsell, JHR
JOURNAL OF NEUROPHYSIOLOGY, 2002, 87 (04) : 1867 - 1888
[39] DIIK-Net: A full-resolution cross-domain deep interaction convolutional neural network for MR image reconstruction
Liu, Yu
Pang, Yanwei
Liu, Xiaohan
Liu, Yiming
Nie, Jing
NEUROCOMPUTING, 2023, 517 : 213 - 222
[40] fMRI Reveals Visual Statistical Learning in Macaque V2
Vergnieux, Victor
Vogels, Rufin
PERCEPTION, 2019, 48 : 92 - 92

← 1 2 3 4 5 →