Enabling Translatability of Generative Face Video Coding: A Unified Face Feature Transcoding Framework

被引:2
|
作者
Yin, Shanzhi [1 ]
Chen, Bolin [1 ]
Wang, Shiqi [1 ]
Ye, Yan [2 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] Alibaba Grp, Hangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/DCC58796.2024.00019
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Generative face video coding (GFVC) can achieve high-quality visual face communication at ultra-low bit-rate ranges via strong facial prior learning and realistic generation. However, different kinds of feature representations hinder the interoperability of GFVC, as the bitstream generated from one type of feature representation can only be correctly understood by the corresponding decoder. In this paper, we make the first attempt to propose a face feature transcoding framework that enables translatability in GFVC. By integrating a face feature transcoder at the decoder side, received face features can be translated to decoder-specific ones for subsequent face reconstruction. Furthermore, the translation between different types of face features can be achieved using a unified transcoding framework, facilitating seamless interoperability between different facial representations and their associated decoders. Experimental results demonstrate that three main-stream GFVC codecs, each utilizing different face features, can be effectively adapted to one another while retaining promising coding performance, largely extending the generality of the GFVC system. The project page can be found at https://github.com/xyzysz/GFVC_Software-Decoder_Interoperability.
引用
收藏
页码:113 / 122
页数:10
相关论文
共 50 条
  • [21] MobileFaceSwap: A Lightweight Framework for Video Face Swapping
    Xu, Zhiliang
    Hong, Zhibin
    Ding, Changxing
    Zhu, Zhen
    Han, Junyu
    Liu, Jingtuo
    Ding, Errui
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2973 - 2981
  • [22] Feature-Improving Generative Adversarial Network for Face Frontalization
    Rong, Changle
    Zhang, Xingming
    Lin, Yubei
    IEEE ACCESS, 2020, 8 : 68842 - 68851
  • [23] Face Feature Weighted Fusion Based on Fuzzy Membership Degree for Video Face Recognition
    Choi, Jae Young
    Plataniotis, Konstantinos N.
    Ro, Yong Man
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (04): : 1270 - 1282
  • [24] KIEGLFN: A unified acne grading framework on face images
    Lin, Yi
    Jiang, Jingchi
    Ma, Zhaoyang
    Chen, Dongxin
    Guan, Yi
    You, Haiyan
    Cheng, Xue
    Liu, Bingmei
    Luo, Gongning
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 221
  • [25] Unified Framework of Face Hallucination across Multiple Modalities
    Ma, Xiang
    Liu, Junhui
    Li, Wenmin
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2014), 2015, 9445
  • [26] Face detection for multidimensional description adaptive video coding
    Jiang, Xiaojun
    Shi, Yunhui
    Sun, Yanfeng
    Yin, Baocai
    Niu, Xiuyan
    Journal of Information and Computational Science, 2008, 5 (05): : 2361 - 2368
  • [27] Geometry Guided Feature Aggregation in Video Face Recognition
    Peng, Baoyun
    Jin, Xiao
    Wu, Yichao
    Li, Dongsheng
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2670 - 2677
  • [28] Face Recognition from Still Images to Video Sequences: A Local-Feature-Based Framework
    Chen, Shaokang
    Mau, Sandra
    Harandi, Mehrtash T.
    Sanderson, Conrad
    Bigdeli, Abbas
    Lovell, Brian C.
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2011,
  • [29] Face Recognition from Still Images to Video Sequences: A Local-Feature-Based Framework
    Shaokang Chen
    Sandra Mau
    Mehrtash T. Harandi
    Conrad Sanderson
    Abbas Bigdeli
    Brian C. Lovell
    EURASIP Journal on Image and Video Processing, 2011
  • [30] Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression
    Chen, Bolin
    Wang, Zhao
    Li, Binzhe
    Lin, Rongqun
    Wang, Shiqi
    Ye, Yan
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 13 - 22