Tracking a face for knowledge-based coding of videophone sequences

被引:10
|
作者
Zhang, L
机构
关键词
face tracking; face model; global head motion compensation; shape update;
D O I
10.1016/S0923-5965(97)00020-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Tracking a face is one of the important topics for knowledge-based coding of videophone sequences and also for the representation of 3D objects within MPEG-4 Synthetic/Natural Hybrid Coding (SNHC). Up to now, the face model has been tracked by global head motion compensation. Because the 3D head model shape affects the accuracy of motion estimation, an inaccurate head model shape reduces the accuracy of face tracking. In this paper, a new algorithm for tracking a face combining global head motion compensation and the update of the face model Candide during the sequence is proposed. As a first stage of the proposed algorithm, face tracking only by global head motion compensation is used. After that, the 2D center positions of the eyes and the mouth of a person in the image sequence are estimated using template matching and feature point extraction techniques. Then, the shape of the face model Candide is updated during the sequence using these estimated 2D center positions. This proposed algorithm has been applied to typical videophone sequences with a spatial resolution corresponding to CIF and a frame rate of 10 Hz. For evaluation, error criteria have been introduced which give position errors of the eyes and the mouth averaged over a whole sequence. The experimental results show that the proposed algorithm reduces the average position errors for the eyes and the mouth by 48% and 53%, respectively, compared to face tracking by global head motion compensation only. (C) 1997 Elsevier Science B.V.
引用
收藏
页码:93 / 114
页数:22
相关论文
共 50 条
  • [1] Segmentation of a head into face, ears, neck and hair for knowledge-based analysis-synthesis coding of videophone sequences
    Kampmann, M
    1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 2, 1998, : 876 - 880
  • [2] Local motion tracking in semantic-based coding of videophone sequences
    Antoszczyszyn, PM
    Hannah, JM
    Grant, PM
    SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND ITS APPLICATIONS, VOL 1, 1997, (443): : 46 - 50
  • [3] VQ CODING FOR VIDEOPHONE APPLICATIONS ADOPTING KNOWLEDGE-BASED TECHNIQUES - IMPLEMENTATION ON PARALLEL ARCHITECTURES
    BRACCINI, C
    GRATTAROLA, A
    LAVAGETTO, F
    ZAPPATORE, S
    EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1992, 3 (02): : 137 - 144
  • [4] VQ coding for videophone applications adopting knowledge-based techniques. Implementation on parallel architectures
    Braccini, Carlo
    Grattarola, Aldo
    Lavagetto, Fabio
    Zappatore, Sandro
    European transactions on telecommunications and related technologies, 1992, 3 (02): : 45 - 52
  • [5] Automatic 3-d face model adaptation for model-based coding of videophone sequences
    Kampmann, M
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (03) : 172 - 182
  • [6] Automatic adaptation of a face model using action units for semantic coding of videophone sequences
    Zhang, L
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1998, 8 (06) : 781 - 795
  • [7] Fuzzy-controlled perceptual coding of videophone sequences
    Leone, A
    Bellini, A
    Guerrieri, R
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1997, 5 (02) : 294 - 303
  • [8] Image segmentation for facial image coding of videophone sequences
    Herodotou, N
    Venetsanopoulos, AN
    DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2: SPECIAL SESSIONS, 1997, : 233 - 236
  • [9] Synthesis of facial expressions for semantic coding of videophone sequences
    Kampmann, M
    Nagel, B
    COMPUTER GRAPHICS INTERNATIONAL, PROCEEDINGS, 1998, : 512 - 519
  • [10] Commonsense knowledge-based face detection
    Kouzani, AZ
    He, F
    Sammut, K
    INES'97 : 1997 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS, PROCEEDINGS, 1997, : 215 - 220