Quality assessment for view synthesis using low-level and mid-level structural representation

被引：7

作者：

Zhou, Yu ^{[1
]}

Li, Leida ^{[1
]}

Ling, Suiyi ^{[2
]}

Le Callet, Patrick ^{[2
]}

机构：

[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China

[2] Univ Nantes, F-44300 Nantes, France

来源：

SIGNAL PROCESSING-IMAGE COMMUNICATION | 2019年 / 74卷

关键词：

View synthesis; Quality evaluation; Low-level; Mid-level; Structural representation; IMAGES;

D O I：

10.1016/j.image.2019.03.005

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

View synthesis is the most important technique in multi-view and free-viewpoint videos. The whole view synthesis includes the acquisition and processing of texture and depth images, and the virtual view rendering stage. Existing quality metrics for view synthesis have limited ability for the whole synthesis process for the following reasons. First, they are dedicated to a single stage of view synthesis, overlooking the commonality of all the possible distortions introduced in the whole process. Moreover, they only extract low-level features for quality assessment, ignoring the perceptual degradation caused by the mid-level contours that are destructed by heavy distortions in texture/depth images and the imperfect view rendering, which represent the spatial distribution/connection of adjacent contour pixels. Inspired by the above facts, this paper presents a quality metric for view synthesis using both Low-level and Mid-level Structural representation (LMS), aiming to accurately evaluate the distortions in the whole view synthesis process. Specifically, the scale space is first constructed to mimic the hierarchical property of the human visual system. Then, the statistics of gradient orientation is integrated with the statistics of gradient intensity for the low-level structural representation, which is motivated by the importance of the orientation selectivity mechanism to visual perception. Further, the mid-level structure is represented using bag of words for contour description based on the sparse coding of the primary visual cortex. Then the distances of both the low-level and mid-level features between the synthesized and reference images are calculated. Finally, two distances are integrated to generate the whole quality score. Extensive experiments on two public view synthesis databases demonstrate the superiority of the proposed method to the state-of-the-arts in evaluating the quality of the whole view synthesis.

引用

页码：309 / 321

页数：13

共 50 条

[1] The role of low-level shear, mid-level shear, and buoyancy in the intensity of modelled low-level mesocyclones
Wicker, LJ
[J]. 19TH CONFERENCE ON SEVERE LOCAL STORMS, 1998, : 222 - 225
[2] Multi-view Facial Expression Recognition Based on Fusing Low-level and Mid-level Features
Bi, Mingyue
Ma, Xin
Song, Rui
Rong, Xuewen
Li, Yibin
[J]. 2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9083 - 9088
[3] Occlusion Boundaries from Motion: Low-Level Detection and Mid-Level Reasoning
Stein, Andrew N.
Hebert, Martial
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 82 (03) : 325 - 357
[4] Occlusion Boundaries from Motion: Low-Level Detection and Mid-Level Reasoning
Andrew N. Stein
Martial Hebert
[J]. International Journal of Computer Vision, 2009, 82 : 325 - 357
[5] Pedestrian re-identification based on fusing low-level and mid-level features
Wang Li
[J]. CHINESE OPTICS, 2016, 9 (05): : 540 - 546
[6] Merging segmentations of low-level and mid-level time series for audio class discovery
Radhakrishnan, Regunathan
Divakaran, Ajay
[J]. 2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 64 - +
[7] Depth and lightness: Mid-level model tested against high- and low-level models
Gilchrist, A.
Radonjic, A.
Todorovic, D.
[J]. PERCEPTION, 2006, 35 : 181 - 181
[8] SuperFloxels: A Mid-level Representation for Video Sequences
Ravichandran, Avinash
Wang, Chaohui
Raptis, Michalis
Soatto, Stefano
[J]. COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 131 - 140
[9] Image Classification Using Mixed-Order Structural Representation based on Mid-Level Feature
Jiang, Bing
Song, Yan
Dai, Li-Rong
[J]. 2013 SIXTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2013, : 144 - 149
[10] Gameplay genre video classification by using mid-level video representation
de Souza, Renato Augusto
de Almeida, Raquel Pereira
Moldovan, Arghir-Nicolae
do Patrocinio, Zenilton Kleber G., Jr.
Guimaraes, Silvio Jamil F.
[J]. 2016 29TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2016, : 188 - 194

← 1 2 3 4 5 →