Region-of-Interest Based Conversational HEVC Coding with Hierarchical Perception Model of Face

被引:71
|
作者
Xu, Mai [1 ]
Deng, Xin [1 ]
Li, Shengxi [1 ]
Wang, Zulin [1 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
基金
美国国家科学基金会;
关键词
HEVC; perceptual video compression; teleconferencing; rate distortion; FOVEATED VIDEO; COMMUNICATION;
D O I
10.1109/JSTSP.2014.2314864
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a region-of-interest (ROI) based HEVC coding approach for conversational videos, with a novel hierarchical perception model of face (HP model), to improve the perceived visual quality of state-of-the-art HEVC standard. In contrast to the previous ROI-based video coding approaches, this novel HP model allows the unequal importance of facial features (e. g., the eyes and mouth) within the facial region, by generating a pixel-wise weight map. Benefitting from such a perception model, the adaptive coding tree unit (CTU) partition structure is developed to alleviate the encoding complexity of HEVC, without any degradation of the visual quality in facial regions, especially in the regions of facial features. Subsequently, for the rate control in HEVC a weight-based unified rate-quantization (URQ) scheme, instead of the conventional pixel-based URQ scheme, is proposed to adaptively adjust the value of quantization parameter (QP). Such an adaptive adjustment of QPs is capable of allocating more bits to the face/facial features with respect to our HP model, and as a result, the visual quality of face, in particular facial features, can be enhanced for conversational HEVC coding. Finally, the experimental results show that the perceived visual quality of our approach is greatly improved, with even less encoding time, for conversational video coding on the HEVC platform.
引用
收藏
页码:475 / 489
页数:15
相关论文
共 50 条
  • [21] Region-of-interest based rate control for UAV video coding
    Zhao C.-L.
    Dai M.
    Xiong J.-Y.
    [J]. Optoelectronics Letters, 2016, 12 (3) : 216 - 220
  • [22] Automatic face detection and tracking for H.263 compatible region-of-interest coding
    Menser, B
    Wien, M
    [J]. IMAGE AND VIDEO COMMUNICATIONS AND PROCESSING 2000, 2000, 3974 : 882 - 891
  • [23] Research on Application of Region-of-Interest in Face Detection
    Zhou, Deng-feng
    Ye, Shui-sheng
    Hu, Shao-hua
    [J]. 2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 192 - 194
  • [24] New region-of-interest image coding method
    Wang, Ruixin
    Wu, Jin
    Chen, Min
    Chen, Jingjing
    [J]. REMOTE SENSING AND GIS DATA PROCESSING AND APPLICATIONS; AND INNOVATIVE MULTISPECTRAL TECHNOLOGY AND APPLICATIONS, PTS 1 AND 2, 2007, 6790
  • [25] Region-of-Interest Image Coding Based on Perceptually Optimized Bitplane Realignment
    Zhang, Yan
    Gu, Hai-ming
    [J]. ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 495 - 498
  • [26] Region-of-Interest Based Pixel Domain Wyner-Ziv Coding
    Jung, Chunsung
    Jun, Dongsan
    Oh, Jieun
    Park, Hyunwook
    Ha, Jeongseok
    [J]. MILITARY COMMUNICATIONS CONFERENCE, 2010 (MILCOM 2010), 2010, : 283 - 286
  • [27] STACKELBERG GAME BASED RATE ALLOCATION FOR HEVC REGION OF INTEREST CODING
    Liu, Zizheng
    Pan, Xiang
    Li, Yiming
    Chen, Zhenzhong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [28] Foreground/background bit allocation for region-of-interest coding
    Chai, D
    Ngan, KN
    Bouzerdoum, A
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 923 - 926
  • [29] Multiple region-of-interest support in scalable video coding
    Bae, TM
    Thang, TC
    Kim, DY
    Ro, YM
    Kang, JW
    Kim, JG
    [J]. ETRI JOURNAL, 2006, 28 (02) : 239 - 242
  • [30] Error-resilient region-of-interest video coding
    Jerbi, A
    Wang, H
    Shirani, S
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (09) : 1175 - 1181