Real-time object segmentation and coding for selective-quality video communications

被引:4
|
作者
Challapali, K
Brodsky, T
Lin, YT
Yan, Y
Chen, RY
机构
[1] Philips Res, Briarcliff Manor, NY 10510 USA
[2] ActivEye Inc, Briarcliff Manor, NY 10510 USA
[3] Polycom Inc, Austin, TX 78746 USA
关键词
MPEG-4; multi-object coding; object segmentation; rate control; real-time content extraction;
D O I
10.1109/TCSVT.2004.828337
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The MPEG-4 standard enables the representation of video as a collection of objects. This paper describes an automatic system that exploits such a representation. Our system consists of two parts: real-time content extraction algorithms and a real-time multi-object rate control method. We present two approaches to content extraction: foreground segmentation based on two cameras and face segmentation based on a single camera. The main contributions of this paper are: 1) under a stereo camera setup, we improve a disparity estimation algorithm to obtain crisp and smooth boundaries of foreground objects; 2) for a single camera scenario, we propose a novel algorithm for face detection and tracking, combining facial color and structure information; and 3) we develop a constant-quality variable bitrate (CQ-VBR) control algorithm that guarantees the quality specification for each object obtained from the two content extraction methods. Both segmentation algorithms run in real-time on a low-cost media processor, and have been tested extensively in various indoor environments. The CQ-VBR control algorithm is a useful tool for the evaluation of object-based coding. For low-bit-rate applications, we can achieve significant reduction in the overall bitrate, while maintaining the same visual quality of the foreground/face object as compared to conventional frame-based coding. Based on tests conducted on several sequences of different complexity levels, the bit-rate savings can be up to 48%. The satisfactory foreground segmentation (results presented) permits porting a live foreground object into arbitrary scenes to create composite video.
引用
收藏
页码:813 / 824
页数:12
相关论文
共 50 条
  • [31] Adaptive quality control for real-time MPEG-4 video communications
    Chuaywong, S
    Kamolphiwong, S
    Kamolphiwong, T
    [J]. International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 299 - 303
  • [32] A PARALLEL ARCHITECTURE FOR REAL-TIME VIDEO CODING
    DESA, L
    SILVA, V
    PERDIGAO, F
    FARIA, S
    ASSUNCAO, P
    [J]. MICROPROCESSING AND MICROPROGRAMMING, 1990, 30 (1-5): : 439 - 445
  • [33] Complexity control for real-time video coding
    Akyol, Emrah
    Mukherjee, Debargha
    Liu, Yuxin
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 77 - +
  • [34] Real-time segmentation of video on a multiprocessor platform
    Arapis, C
    Gibbs, S
    Breiteneder, C
    [J]. PARALLEL COMPUTING, 1997, 23 (12) : 1777 - 1792
  • [35] A real-time object detection algorithm for video
    Lu, Shengyu
    Wang, Beizhan
    Wang, Hongji
    Chen, Lihao
    Ma Linjian
    Zhang, Xiaoyan
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2019, 77 : 398 - 408
  • [36] Real-time Moving Object Tracking in Video
    Kodjo, Amedome Min-Dianey
    Yang Jinhua
    [J]. 2012 INTERNATIONAL CONFERENCE ON OPTOELECTRONICS AND MICROELECTRONICS (ICOM), 2012, : 580 - 584
  • [37] Real-Time Prediction of Segmentation Quality
    Robinson, Robert
    Oktay, Ozan
    Bai, Wenjia
    Valindria, Vanya V.
    Sanghvi, Mihir M.
    Aung, Nay
    Paiva, Jose M.
    Zemrak, Filip
    Fung, Kenneth
    Lukaschuk, Elena
    Lee, Aaron M.
    Carapella, Valentina
    Kim, Young Jin
    Kainz, Bernhard
    Piechnik, Stefan K.
    Neubauer, Stefan
    Petersen, Steffen E.
    Page, Chris
    Rueckert, Daniel
    Glocker, Ben
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT IV, 2018, 11073 : 578 - 585
  • [38] Error resilient video coding schemes for real-time and low-bitrate mobile communications
    Imura, K
    Machida, Y
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 1999, 14 (6-8) : 519 - 530
  • [39] Real-time video quality monitoring
    Liu, Tao
    Narvekar, Niranjan
    Wang, Beibei
    Ding, Ran
    Zou, Dekun
    Cash, Glenn
    Bhagavathy, Sitaram
    Bloom, Jeffrey
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2011, : 1 - 18
  • [40] Real-time video quality monitoring
    Tao Liu
    Niranjan Narvekar
    Beibei Wang
    Ran Ding
    Dekun Zou
    Glenn Cash
    Sitaram Bhagavathy
    Jeffrey Bloom
    [J]. EURASIP Journal on Advances in Signal Processing, 2011