Quality assessment for view synthesis using low-level and mid-level structural representation

被引:7
|
作者
Zhou, Yu [1 ]
Li, Leida [1 ]
Ling, Suiyi [2 ]
Le Callet, Patrick [2 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[2] Univ Nantes, F-44300 Nantes, France
关键词
View synthesis; Quality evaluation; Low-level; Mid-level; Structural representation; IMAGES;
D O I
10.1016/j.image.2019.03.005
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
View synthesis is the most important technique in multi-view and free-viewpoint videos. The whole view synthesis includes the acquisition and processing of texture and depth images, and the virtual view rendering stage. Existing quality metrics for view synthesis have limited ability for the whole synthesis process for the following reasons. First, they are dedicated to a single stage of view synthesis, overlooking the commonality of all the possible distortions introduced in the whole process. Moreover, they only extract low-level features for quality assessment, ignoring the perceptual degradation caused by the mid-level contours that are destructed by heavy distortions in texture/depth images and the imperfect view rendering, which represent the spatial distribution/connection of adjacent contour pixels. Inspired by the above facts, this paper presents a quality metric for view synthesis using both Low-level and Mid-level Structural representation (LMS), aiming to accurately evaluate the distortions in the whole view synthesis process. Specifically, the scale space is first constructed to mimic the hierarchical property of the human visual system. Then, the statistics of gradient orientation is integrated with the statistics of gradient intensity for the low-level structural representation, which is motivated by the importance of the orientation selectivity mechanism to visual perception. Further, the mid-level structure is represented using bag of words for contour description based on the sparse coding of the primary visual cortex. Then the distances of both the low-level and mid-level features between the synthesized and reference images are calculated. Finally, two distances are integrated to generate the whole quality score. Extensive experiments on two public view synthesis databases demonstrate the superiority of the proposed method to the state-of-the-arts in evaluating the quality of the whole view synthesis.
引用
收藏
页码:309 / 321
页数:13
相关论文
共 50 条
  • [1] The role of low-level shear, mid-level shear, and buoyancy in the intensity of modelled low-level mesocyclones
    Wicker, LJ
    [J]. 19TH CONFERENCE ON SEVERE LOCAL STORMS, 1998, : 222 - 225
  • [2] Multi-view Facial Expression Recognition Based on Fusing Low-level and Mid-level Features
    Bi, Mingyue
    Ma, Xin
    Song, Rui
    Rong, Xuewen
    Li, Yibin
    [J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9083 - 9088
  • [3] Occlusion Boundaries from Motion: Low-Level Detection and Mid-Level Reasoning
    Stein, Andrew N.
    Hebert, Martial
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 82 (03) : 325 - 357
  • [4] Occlusion Boundaries from Motion: Low-Level Detection and Mid-Level Reasoning
    Andrew N. Stein
    Martial Hebert
    [J]. International Journal of Computer Vision, 2009, 82 : 325 - 357
  • [5] Pedestrian re-identification based on fusing low-level and mid-level features
    Wang Li
    [J]. CHINESE OPTICS, 2016, 9 (05): : 540 - 546
  • [6] Merging segmentations of low-level and mid-level time series for audio class discovery
    Radhakrishnan, Regunathan
    Divakaran, Ajay
    [J]. 2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 64 - +
  • [7] Depth and lightness: Mid-level model tested against high- and low-level models
    Gilchrist, A.
    Radonjic, A.
    Todorovic, D.
    [J]. PERCEPTION, 2006, 35 : 181 - 181
  • [8] SuperFloxels: A Mid-level Representation for Video Sequences
    Ravichandran, Avinash
    Wang, Chaohui
    Raptis, Michalis
    Soatto, Stefano
    [J]. COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 131 - 140
  • [9] Image Classification Using Mixed-Order Structural Representation based on Mid-Level Feature
    Jiang, Bing
    Song, Yan
    Dai, Li-Rong
    [J]. 2013 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2013, : 144 - 149
  • [10] Gameplay genre video classification by using mid-level video representation
    de Souza, Renato Augusto
    de Almeida, Raquel Pereira
    Moldovan, Arghir-Nicolae
    do Patrocinio, Zenilton Kleber G., Jr.
    Guimaraes, Silvio Jamil F.
    [J]. 2016 29TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2016, : 188 - 194