AUTOMATIC FACE LOCATION DETECTION AND TRACKING FOR MODEL-ASSISTED CODING OF VIDEO TELECONFERENCING SEQUENCES AT LOW BIT-RATES

被引:75
|
作者
ELEFTHERIADIS, A [1 ]
JACQUIN, A [1 ]
机构
[1] COLUMBIA UNIV,DEPT ELECT ENGN,NEW YORK,NY 10027
关键词
FACE TRACKING; MODEL-BASED CODING; TELECONFERENCING; VIDEO CODING;
D O I
10.1016/0923-5965(95)00028-U
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a novel and practical way to integrate techniques from computer vision to low bit-rate coding systems for video teleconferencing applications. Our focus is to locate and track the faces of persons in typical head-and-shoulders video sequences, and to exploit the face location information in a 'classical' video coding/decoding system, The motivation is to enable the system to selectively encode various image areas and to produce psychologically pleasing coded images where faces are sharper, We refer to this approach as model-assisted coding. We propose a totally automatic, low-complexity algorithm, which robustly performs face detection and tracking. A priori assumptions regarding sequence content are minimal and the algorithm operates accurately even in cases of partial occlusion by moving objects. Face location information is exploited by a low bit-rate 3D subband-based video coder which uses both a novel model-assisted pixel-based motion compensation scheme, as well as model-assisted dynamic bit allocation with object-selective quantization. By transferring a small fraction of the total available bit-rate from the non-facial to the facial area, the coder produces images with better-rendered facial features. The improvement was found to be perceptually significant on video sequences coded at 96 kbps for an input luminance signal in CIF format, The technique is applicable to any video coding scheme that allows for fine-grain quantizer selection (e.g. MPEG, H.261), and can maintain full decoder compatibility.
引用
收藏
页码:231 / 248
页数:18
相关论文
共 13 条
  • [1] AUTOMATIC FACE LOCATION DETECTION FOR MODEL-ASSISTED RATE CONTROL IN H.261-COMPATIBLE CODING OF VIDEO
    ELEFTHERIADIS, A
    JACQUIN, A
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 1995, 7 (4-6) : 435 - 455
  • [2] Automatic face location detection and tracking for model-based video coding
    Ngan, KN
    Rudianto, RL
    [J]. ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 1098 - 1101
  • [3] Fast video coding at low bit-rates for mobile devices
    Jindal, M
    Prasad, RSV
    Ramkishor, K
    [J]. ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 483 - 487
  • [4] Contourless region-based video coding for very low bit-rates
    Salgado, L
    Garcia, N
    Menendez, JM
    Rendon, E
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 299 - 303
  • [5] Motion-adaptive modelling of scene content for very low bit rate model-assisted coding of video
    Rabiner, W
    Jacquin, A
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 1997, 8 (03) : 250 - 262
  • [6] Video coding of model-based at very low bit rates
    Fu, XP
    Wang, Z
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2003, PTS 1-3, 2003, 5150 : 1224 - 1231
  • [7] Spatio-temporal model-assisted very low-bit-rate coding with compatibility
    Lee, JB
    Eleftheriadis, A
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (12) : 1517 - 1532
  • [8] A Long Term Harmonic plus Noise Model for Narrow-Band Speech Coding at Very Low Bit-Rates
    Ben Ali, Faten
    Djaziri-Larbi, Sonia
    [J]. 2017 40TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2017, : 372 - 376
  • [9] A low-complexity face-assisted coding scheme for low bit-rate video telephony
    Lin, Chia-Wen
    Chang, Yao-Jen
    Chen, Yung-Chang
    [J]. IEICE Transactions on Information and Systems, 2003, E86-D (01) : 101 - 108
  • [10] A low-complexity face-assisted coding scheme for low bit-rate video telephony
    Lin, CW
    Chang, YJ
    Chen, YC
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (01): : 101 - 108