Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face

被引:71
|
作者
Xu, Mai [1 ]
Deng, Xin [1 ]
Li, Shengxi [1 ]
Wang, Zulin [1 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
基金
美国国家科学基金会;
关键词
HEVC; perceptual video compression; teleconferencing; rate distortion; FOVEATED VIDEO; COMMUNICATION;
D O I
10.1109/JSTSP.2014.2314864
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a region-of-interest (ROI) based HEVC coding approach for conversational videos, with a novel hierarchical perception model of face (HP model), to improve the perceived visual quality of state-of-the-art HEVC standard. In contrast to the previous ROI-based video coding approaches, this novel HP model allows the unequal importance of facial features (e. g., the eyes and mouth) within the facial region, by generating a pixel-wise weight map. Benefitting from such a perception model, the adaptive coding tree unit (CTU) partition structure is developed to alleviate the encoding complexity of HEVC, without any degradation of the visual quality in facial regions, especially in the regions of facial features. Subsequently, for the rate control in HEVC a weight-based unified rate-quantization (URQ) scheme, instead of the conventional pixel-based URQ scheme, is proposed to adaptively adjust the value of quantization parameter (QP). Such an adaptive adjustment of QPs is capable of allocating more bits to the face/facial features with respect to our HP model, and as a result, the visual quality of face, in particular facial features, can be enhanced for conversational HEVC coding. Finally, the experimental results show that the perceived visual quality of our approach is greatly improved, with even less encoding time, for conversational video coding on the HEVC platform.
引用
收藏
页码:475 / 489
页数:15
相关论文
共 50 条
  • [1] Region-of-Interest Coding based on Fovea and Hierarchical Trees
    Galan-Hernandez, J. C.
    Alarcon-Aquino, V.
    Ramirez-Cortes, J. M.
    Starostenko, O.
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2013, 42 (04): : 343 - 352
  • [2] Region-of-interest video coding based on face detection
    Chen, JW
    Chen, MJ
    Chi, MC
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 1201 - 1211
  • [3] Complexity Control of HEVC Based on Region-of-Interest Attention Model
    Deng, Xin
    Xu, Mai
    Li, Shengxi
    Wang, Zulin
    [J]. 2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 225 - 228
  • [4] Region-of-interest coding based on set partitioning in hierarchical trees
    Park, KH
    Park, HW
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (02) : 106 - 113
  • [5] Region-of-interest coding based on set partitioning in hierarchical trees
    Park, KH
    Lee, CS
    Park, HW
    [J]. 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2001, : 804 - 807
  • [6] Region-of-interest image coding based on EBCOT
    Yang, H
    Long, M
    Tai, HM
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2005, 152 (05): : 590 - 596
  • [7] Hierarchical region-of-interest detection
    Lin, Huibao
    Si, Jennie
    Abousleman, Glen P.
    [J]. OPTICAL ENGINEERING, 2006, 45 (07)
  • [8] REGION-OF-INTEREST ENCRYPTION IN HEVC COMPRESSED VIDEO
    Tew, Yiqi
    Wong, KokSheik
    Phan, Raphael C. -W.
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2016, : 3 - 4
  • [9] Lossy/lossless Region-of-Interest image coding based on set partitioning in hierarchical trees
    Atsumi, E
    Farvardin, N
    [J]. 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 1, 1998, : 87 - 91
  • [10] Face Region Based Conversational Video Coding
    Xiong, Bing
    Fan, Xiaojiu
    Zhu, Ce
    Jing, Xuan
    Peng, Qiang
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (07) : 917 - 931