Video Colorization Based on Variational Autoencoder

被引:0
|
作者
Zhang, Guangzi [1 ]
Hong, Xiaolin [1 ]
Liu, Yan [1 ]
Qian, Yulin [1 ]
Cai, Xingquan [1 ]
机构
[1] North China Univ Technol, Sch Informat Sci & Technol, Beijing 100144, Peoples R China
关键词
video colorization; temporal consistency; variational autoencoder;
D O I
10.3390/electronics13122412
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a variational autoencoder network designed for video colorization using reference images, addressing the challenge of colorizing black-and-white videos. Although recent techniques perform well in some scenarios, they often struggle with color inconsistencies and artifacts in videos that feature complex scenes and long durations. To tackle this, we propose a variational autoencoder framework that incorporates spatio-temporal information for efficient video colorization. To improve temporal consistency, we unify semantic correspondence with color propagation, allowing for simultaneous guidance in colorizing grayscale video frames. Additionally, the variational autoencoder learns spatio-temporal feature representations by mapping video frames into a latent space through an encoder network. The decoder network then transforms these latent features back into color images. Compared to traditional coloring methods, our approach accurately captures temporal relationships between video frames, providing precise colorization while ensuring video consistency. To further enhance video quality, we apply a specialized loss function that constrains the generated output, ensuring that the colorized video remains spatio-temporally consistent and natural. Experimental results demonstrate that our method significantly improves the video colorization process.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Network Embedding via Community Based Variational Autoencoder
    Shi, Wei
    Huang, Ling
    Wang, Chang-Dong
    Li, Juan-Hui
    Tang, Yong
    Fu, Chengzhou
    [J]. IEEE ACCESS, 2019, 7 : 25323 - 25333
  • [42] Automatic video colorization based on contrastive learning and optical flow
    Xiao S.
    Wang Y.
    Wang Y.
    [J]. Multimedia Tools and Applications, 2024, 83 (21) : 59985 - 60001
  • [43] Video anomaly detection and localization via Gaussian Mixture Fully Convolutional Variational Autoencoder
    Fan, Yaxiang
    Wen, Gongjian
    Li, Deren
    Qiu, Shaohua
    Levine, Martin D.
    Xiao, Fei
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 195
  • [44] Unsupervised Anomaly Video Detection via a Double-Flow ConvLSTM Variational Autoencoder
    Wang, Lin
    Tan, Haishu
    Zhou, Fuqiang
    Zuo, Wangxia
    Sun, Pengfei
    [J]. IEEE ACCESS, 2022, 10 : 44278 - 44289
  • [45] Video colorization method based on fusion of multi-source colorization results using dual reference frames
    Meng, Hua
    Tang, Jinhui
    Dai, Longquan
    [J]. Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition), 2024, 54 (01): : 183 - 191
  • [46] Laughter synthesis: A comparison between Variational autoencoder and Autoencoder
    Mansouri, Nadia
    Lachiri, Zied
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP'2020), 2020,
  • [48] SVCNet: Scribble-Based Video Colorization Network With Temporal Aggregation
    Zhao, Yuzhi
    Po, Lai-Man
    Liu, Kangcheng
    Wang, Xuehui
    Yu, Wing-Yin
    Xian, Pengfei
    Zhang, Yujia
    Liu, Mengyang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4443 - 4458
  • [49] A fast colorization algorithm for infrared video
    He, Mengchi
    Gu, Xiaojing
    Gu, Xingsheng
    [J]. Communications in Computer and Information Science, 2014, 462 : 282 - 292
  • [50] A Fast Colorization Algorithm for Infrared Video
    He, Mengchi
    Gu, Xiaojing
    Gu, Xingsheng
    [J]. COMPUTATIONAL INTELLIGENCE, NETWORKED SYSTEMS AND THEIR APPLICATIONS, 2014, 462 : 282 - 292