Video Harmonization with Triplet Spatio-Temporal Variation Patterns

被引:0
|
作者
Guo, Zonghui [1 ]
Han, Xinyu [2 ]
Zhang, Jie [1 ,3 ]
Shan, Shiguang [1 ,3 ]
Zheng, Haiyong [2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Ocean Univ China, Coll Elect Engn, Qingdao, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52733.2024.01814
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video harmonization is an important and challenging task that aims to obtain visually realistic composite videos by automatically adjusting the foreground's appearance to harmonize with the background. Inspired by the short-term and long-term gradual adjustment process of manual harmonization, we present a Video Triplet Transformer framework to model three spatio-temporal variation patterns within videos, i.e., short-term spatial as well as long-term global and dynamic, for video-to-video tasks like video harmonization. Specifically, for short-term harmonization, we adjust foreground appearance to consist with background in spatial dimension based on the neighbor frames; for long-term harmonization, we not only explore global appearance variations to enhance temporal consistency but also alleviate motion offset constraints to align similar contextual appearances dynamically. Extensive experiments and ablation studies demonstrate the effectiveness of our method, achieving state-of-the-art performance in video harmonization, video enhancement, and video demoireing tasks. We also propose a temporal consistency metric to better evaluate the harmonized videos. Code is available at https://github.com/zhenglab/VideoTripletTransformer.
引用
收藏
页码:19177 / 19186
页数:10
相关论文
共 50 条
  • [21] Spatio-temporal patterns in population dynamics
    La Barbera, A
    Spagnolo, B
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2002, 314 (1-4) : 120 - 124
  • [22] Spatio-temporal patterns of precipitation in Serbia
    Gocic, Milan
    Trajkovic, Slavisa
    THEORETICAL AND APPLIED CLIMATOLOGY, 2014, 117 (3-4) : 419 - 431
  • [23] Mining generalized spatio-temporal patterns
    Wang, JM
    Hsu, WN
    Lee, ML
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 649 - 661
  • [24] Velocities for spatio-temporal point patterns
    Schliep, Erin M.
    Gelfand, Alan E.
    SPATIAL STATISTICS, 2019, 29 : 204 - 225
  • [25] Video Question Answering with Spatio-Temporal Reasoning
    Jang, Yunseok
    Song, Yale
    Kim, Chris Dongjoo
    Yu, Youngjae
    Kim, Youngjin
    Kim, Gunhee
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2019, 127 (10) : 1385 - 1412
  • [26] Spatio-temporal indexing of video in the wavelet domain
    Mandal, MK
    Panchanathan, S
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1542 - 1550
  • [27] Spatio-Temporal Scale Selection in Video Data
    Tony Lindeberg
    Journal of Mathematical Imaging and Vision, 2018, 60 : 525 - 562
  • [28] Video sequence matching with spatio-temporal constraints\
    Ren, W
    Singh, S
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 834 - 837
  • [29] A Spatio-temporal Approach for Video Caption Extraction
    Chen, Liang-Hua
    Hsieh, Meng-Chen
    Su, Chih-Wen
    SIGMAP: PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON E-BUSINESS AND TELECOMMUNICATIONS - VOL. 5, 2016, : 83 - 88
  • [30] Spatio-Temporal Scale Selection in Video Data
    Lindeberg, Tony
    SCALE SPACE AND VARIATIONAL METHODS IN COMPUTER VISION, SSVM 2017, 2017, 10302 : 3 - 15