3-D model-based frame interpolation for distributed video coding of static scenes

被引:6
|
作者
Maitre, Matthieu [1 ]
Guillemot, Christine
Morin, Luce
机构
[1] Univ Illinois, Beckman Inst, Urbana, IL 61801 USA
[2] Inst Rech Informat & Syst Aleatoires, F-35042 Rennes, France
关键词
distributed source coding (DSC); distributed video coding (DVC); image-based rendering (IBR); motion-adaptive key frames; point tracking; structure-from-motion (SfM);
D O I
10.1109/TIP.2007.894272
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of side information extraction for distributed coding of videos captured by a camera moving in a 3-D static environment. Examples of targeted applications are augmented reality, remote-controlled robots operating in hazardous environments, or remote exploration by drones. It explores the benefits of the structure-from-motion paradigm for distributed coding of this type of video content. Two interpolation methods constrained by the scene geometry, based either on block matching along epipolar lines or on 3-D mesh fitting, are first developed. These techniques are based on a robust algorithm for sub-pel matching of feature points, which leads to semi-dense correspondences between key frames. However, their rate-distortion (RD) performances are limited by misalignments between the side information and the actual Wyner-Ziv (WZ) frames due to the assumption of linear motion between key frames. To cope with this problem, two feature point tracking techniques are introduced, which recover the camera parameters of the WZ frames. A first technique, in which the frames remain encoded separately, performs tracking at the decoder and leads to significant RD performance gains. A second technique further improves the RD performances by allowing a limited tracking at the encoder. As an additional benefit, statistics on tracks allow the encoder to adapt the key frame frequency to the video motion content.
引用
收藏
页码:1246 / 1257
页数:12
相关论文
共 50 条
  • [1] Towards 3-D model-based video coding
    Soligon, O
    Le Mehaute, A
    Roux, C
    [J]. ANNALES DES TELECOMMUNICATIONS-ANNALS OF TELECOMMUNICATIONS, 1998, 53 (5-6): : 229 - 241
  • [2] Fast and automatic facial feature extraction for 3-D model-based video coding
    Sheng, Y
    Sadka, AH
    Kondoz, AM
    [J]. Digital Media: Processing Multimedia Interactive Services, 2003, : 387 - 390
  • [3] MODEL-BASED RATE ALLOCATION IN DISTRIBUTED VIDEO CODING SYSTEMS
    Priziment, Yevgeny
    Malah, David
    [J]. 2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, : 701 - 705
  • [4] Stereo video coding based on frame estimation and interpolation
    Luo, Y
    Zhang, ZY
    An, P
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2003, 49 (01) : 14 - 21
  • [5] 3D SCENE MODEL BASED FRAME PREDICTION IN VIDEO CODING
    Golestani, Hossein Bakhshi
    Wien, Mathias
    Ohm, Jens-Rainer
    [J]. 2017 INTERNATIONAL CONFERENCE ON 3D IMMERSION (IC3D), 2017,
  • [6] Model-Based Joint Bit Allocation Between Texture Videos and Depth Maps for 3-D Video Coding
    Yuan, Hui
    Chang, Yilin
    Huo, Junyan
    Yang, Fuzheng
    Lu, Zhaoyang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) : 485 - 497
  • [7] Video Frame Interpolation Using 3-D Total Variation Regularized Completion
    Yu, Zhefei
    Wang, Zhangyang
    Hu, Zeng
    Ling, Qing
    Li, Houqiang
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 857 - 860
  • [8] Using a 3-D shape model for video coding
    Chang, CL
    Eisert, P
    Girod, B
    [J]. VISION MODELING, AND VISUALIZATION 2002, PROCEEDINGS, 2002, : 291 - 297
  • [9] Automatic 3-d face model adaptation for model-based coding of videophone sequences
    Kampmann, M
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (03) : 172 - 182
  • [10] 3-D MOTION ESTIMATION IN MODEL-BASED FACIAL IMAGE-CODING
    LI, HB
    ROIVAINEN, P
    FORCHHEIMER, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1993, 15 (06) : 545 - 555