Intra Prediction Method for Depth Video Coding by Block Clustering through Deep Learning

被引:0
|
作者
Lee, Dong-seok [1 ]
Kwon, Soon-kak [2 ]
机构
[1] Dong Eui Univ, AI Grand ICT Res Ctr, Busan 47340, South Korea
[2] Dong Eui Univ, Dept Comp Software Engn, Busan 47340, South Korea
基金
新加坡国家研究基金会;
关键词
intra prediction; depth video coding; deep learning; 1D CNN; clustering; COMPRESSION; NETWORK;
D O I
10.3390/s22249656
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In this paper, we propose an intra-picture prediction method for depth video by a block clustering through a neural network. The proposed method solves a problem that the block that has two or more clusters drops the prediction performance of the intra prediction for depth video. The proposed neural network consists of both a spatial feature prediction network and a clustering network. The spatial feature prediction network utilizes spatial features in vertical and horizontal directions. The network contains a 1D CNN layer and a fully connected layer. The 1D CNN layer extracts the spatial features for a vertical direction and a horizontal direction from a top block and a left block of the reference pixels, respectively. 1D CNN is designed to handle time-series data, but it can also be applied to find the spatial features by regarding a pixel order in a certain direction as a timestamp. The fully connected layer predicts the spatial features of the block to be coded through the extracted features. The clustering network finds clusters from the spatial features which are the outputs of the spatial feature prediction network. The network consists of 4 CNN layers. The first 3 CNN layers combine two spatial features in the vertical and horizontal directions. The last layer outputs the probabilities that pixels belong to the clusters. The pixels of the block are predicted by the representative values of the clusters that are the average of the reference pixels belonging to the clusters. For the intra prediction for various block sizes, the block is scaled to the size of the network input. The prediction result through the proposed network is scaled back to the original size. In network training, the mean square error is used as a loss function between the original block and the predicted block. A penalty for output values far from both ends is introduced to the loss function for clear network clustering. In the simulation results, the bit rate is saved by up to 12.45% under the same distortion condition compared with the latest video coding standard.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Deep region segmentation-based intra prediction for depth video coding
    Zhang, Jing
    Hou, Yonghong
    Zhang, Zhe
    Jin, Dengchao
    Zhang, Peihan
    Li, Ge
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 35953 - 35964
  • [2] Deep region segmentation-based intra prediction for depth video coding
    Jing Zhang
    Yonghong Hou
    Zhe Zhang
    Dengchao Jin
    Peihan Zhang
    Ge Li
    Multimedia Tools and Applications, 2022, 81 : 35953 - 35964
  • [3] Geometry-based Block Partitioning for Efficient Intra Prediction in Depth Video Coding
    Kang, Min-Koo
    Lee, Jaejoon
    Lee, Jin Young
    Ho, Yo-Sung
    VISUAL INFORMATION PROCESSING AND COMMUNICATION, 2010, 7543
  • [4] Fast Depth Video Coding with Intra Prediction on VVC
    Wei, Hongan
    Zhou, Binqian
    Fang, Ying
    Xu, Yiwen
    Zhao, Tiesong
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (07): : 3018 - 3038
  • [5] HIGH PRIORITY INTRA CODING METHOD FOR DEPTH VIDEO CODING
    Oh, Kwan-Jung
    Lee, Jaejoon
    Park, Du-Sik
    2012 PICTURE CODING SYMPOSIUM (PCS), 2012, : 45 - 48
  • [6] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Wang, Shiqi
    Kwong, Sam
    Jin, Xin
    Qiao, Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181
  • [7] Deep Learning based Angular Intra-Prediction for Lossless HEVC Video Coding
    Huang, Hongyue
    Schiopu, Ionut
    Munteanu, Adrian
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 579 - 579
  • [8] Depth Intra Skip Prediction for 3D Video Coding
    Oh, Kwan-Jung
    Lee, Jaejoon
    Park, Du-Sik
    2012 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2012,
  • [9] ADAPTIVE GEOMETRY-BASED INTRA PREDICTION FOR DEPTH VIDEO CODING
    Kang, Min-Koo
    Lee, Cheon
    Lee, Jin Young
    Ho, Yo-Sung
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1230 - 1235
  • [10] Global-Context Aggregated Intra Prediction Network for Depth Video Coding
    Zhang, Jing
    Hou, Yonghong
    Peng, Bo
    Pan, Zhaoqing
    Li, Ge
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (08) : 3159 - 3163