Quality Enhancement of Screen Content Video using Dual-input CNN

被引:0
|
作者
Huang, Ziyin [1 ]
Cao, Yue [1 ]
Tsang, Sik-Ho [2 ]
Chan, Yui-Lam [1 ]
Lam, Kin-Man [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Hong Kong, Peoples R China
[2] Ctr Adv Reliabil & Safety Ltd CAiRS, Hong Kong Sci Pk, Hong Kong, Peoples R China
关键词
convolutional neural network; deep learning; HEVC; quality enhancement; SCC;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the video quality enhancement techniques have made a significant breakthrough, from the traditional methods, such as deblocking filter (DF) and sample additive offset (SAO), to deep learning-based approaches. While screen content coding (SCC) has become an important extension in High Efficiency Video Coding (HEVC), the existing approaches mainly focus on improving the quality of natural sequences in HEVC, not the screen content (SC) sequences in SCC. Therefore, we proposed a dual-input model for quality enhancement in SCC. One is the main branch with the image as input. Another one is the mask branch with side information extracted from the coded bitstream. Specifically, a mask branch is designed so that the coding unit (CU) information and the mode information are utilized as input, to assist the convolutional network at the main branch to further improve the video quality thereby the coding efficiency. Moreover, due to the limited number of SC videos, a new SCC dataset, namely PolyUSCC, is established. With our proposed dual-input technique, compared with the conventional SCC, BD-rates are further reduced 3.81% and 3.07%, by adding our mask branch onto two state-of-the-art models, DnCNN and DCAD, respectively.
引用
收藏
页码:797 / 803
页数:7
相关论文
共 50 条
  • [41] Digital Predistortion for High Efficiency Power Amplifier Architectures Using a Dual-Input Modeling Approach
    Cao, Haiying
    Nemati, Hossein Mashad
    Tehrani, Ali Soltani
    Eriksson, Thomas
    Fager, Christian
    IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES, 2012, 60 (02) : 361 - 369
  • [42] Dual-input current-mode gate using for digital signal processing in mechatronic systems
    Rajewska, Magdalena
    Walkowiak, Maciej
    MECHATRONIC SYSTEMS, MECHANICS AND MATERIALS, 2012, 180 : 349 - 354
  • [43] Dual-input RC integrator and differentiator with tuneable time constants using current feedback amplifiers
    Lee, JL
    Liu, SI
    ELECTRONICS LETTERS, 1999, 35 (22) : 1910 - 1911
  • [44] Designing Limit-Cycle Suppressor Using Dithering and Dual-Input Describing Function Methods
    Mbitu, Elisabeth Tansiana
    Chen, Seng-Chi
    MATHEMATICS, 2020, 8 (11) : 1 - 14
  • [45] Error analysis of the quantification of hepatic perfusion using a dual-input single-compartment model
    Miyazaki, Shohei
    Yamazaki, Youichi
    Murase, Kenya
    PHYSICS IN MEDICINE AND BIOLOGY, 2008, 53 (21): : 5927 - 5946
  • [46] Video quality enhancement using different enhancement and dehazing techniques
    Ayoub A.
    Naeem E.A.
    El-Shafai W.
    El-Samie F.E.A.
    Hamad E.K.I.
    El-Rabaie E.-S.M.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (12) : 16607 - 16635
  • [47] Fast Video Quality Enhancement using GANs
    Galteri, Leonardo
    Seidenari, Lorenzo
    Bertini, Marco
    Uricchio, Tiberio
    Del Bimbo, Alberto
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1065 - 1067
  • [48] An IBC Reference Block Enhancement Model Based on GAN for Screen Content Video Coding
    Yang, Pengjian
    Wang, Jun
    Zhong, Guangyu
    Zhang, Pengyuan
    Zhang, Lai
    Liang, Fan
    Yang, Jianxin
    MULTIMEDIA MODELING, MMM 2022, PT II, 2022, 13142 : 15 - 26
  • [49] Constant quality video coding using video content analysis
    Overmeire, L
    Nachtergaele, L
    Verdicchio, F
    Barbarien, J
    Schelkens, P
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2005, 20 (04) : 343 - 369
  • [50] Deep Learning Approach for No-Reference Screen Content Video Quality Assessment
    Kwong, Ngai-Wing
    Chan, Yui-Lam
    Tsang, Sik-Ho
    Huang, Ziyin
    Lam, Kin-Man
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 555 - 569