COTS: A Multipurpose RGB-D Dataset for Saliency and Image Manipulation Applications

被引:3
|
作者
Seychell, Dylan [1 ]
Debono, Carl James [1 ]
Bugeja, Mark [2 ]
Borg, Jeremy [2 ]
Sacco, Matthew [1 ]
机构
[1] Univ Malta, Dept Comp & Commun Engn, Msida 2080, Malta
[2] Univ Malta, Dept Artificial Intelligence, Msida 2080, Malta
关键词
Saliency detection; Object detection; Image segmentation; Lighting; Computer vision; Cameras; Semantics; Dataset; RGB-D; salient object detection; inpainting; blending; segmentation; OBJECT DETECTION; MODEL;
D O I
10.1109/ACCESS.2021.3055647
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many modern computer vision systems include several modules that perform different processing operations packaged as a single pipeline architecture. This generally introduces a challenge in the evaluation process since most datasets provide evaluation data for just one of the operations. In this paper, we present an RGB-D dataset that was designed from first principles to cater for applications that involve salient object detection, segmentation, inpainting and blending techniques. This addresses a gap in the evaluation of image inpainting and blending applications that generally rely on subjective evaluation due to the lack of availability of comparative data. A set of experiments were carried out to demonstrate how the COTS dataset can be used to evaluate these different applications. This dataset includes a variety of scenes, where each scene is captured multiple times, each time adding a new object to the previous scene. This allows for a comparative analysis at pixel level in image inpainting and blending applications. Moreover, all objects were manually labeled in order to offer the possibility of salient object detection even in scenes that contain multiple objects. An online test with 1267 participants was also carried out, and this dataset also includes the click coordinates of users' selection for every image, introducing a user interaction dimension in the same RGB-D dataset. This dataset was also validated using state of the art techniques, obtaining an F-beta of 0.957 in salient object detection and a mean (Intersection over Union) IoU of 0.942 in Segmentation. Results demonstrate that the COTS dataset introduces novel possibilities for the evaluation of computer vision applications.
引用
收藏
页码:21481 / 21497
页数:17
相关论文
共 50 条
  • [1] RGB-D Saliency Detection: Dataset and Algorithm for Robot Vision
    Yuan, Xia
    Yue, Juan
    Zhang, Yanan
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1028 - 1033
  • [2] Selective Features for RGB-D Saliency
    Zhu, Lei
    Cao, Zhiguo
    Fang, Zhiwen
    Xiao, Yang
    Wu, Jin
    Deng, Huiping
    Liu, Jing
    [J]. 2015 CHINESE AUTOMATION CONGRESS (CAC), 2015, : 512 - 517
  • [3] RGB-D image saliency detection from 3D perspective
    Liu, Zhengyi
    Song, Tengfei
    Xie, Feng
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (06) : 6787 - 6804
  • [4] RGB-D image saliency detection from 3D perspective
    Zhengyi Liu
    Tengfei Song
    Feng Xie
    [J]. Multimedia Tools and Applications, 2019, 78 : 6787 - 6804
  • [5] SALIENT OBJECT DETECTION FOR RGB-D IMAGE VIA SALIENCY EVOLUTION
    Guo, Jingfan
    Ren, Tongwei
    Bei, Jia
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [6] Robust RGB-D Fusion for Saliency Detection
    Wu, Zongwei
    Gobichettipalayam, Shriarulmozhivarman
    Tamadazte, Brahim
    Allibert, Guillaume
    Paudel, Danda Pani
    Demonceaux, Cedric
    [J]. 2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 403 - 413
  • [7] Uncertainty Inspired RGB-D Saliency Detection
    Zhang, Jing
    Fan, Deng-Ping
    Dai, Yuchao
    Anwar, Saeed
    Saleh, Fatemeh
    Aliakbarian, Sadegh
    Barnes, Nick
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5761 - 5779
  • [8] Learning an Intrinsic Image Decomposer Using Synthesized RGB-D Dataset
    Han, Guangyun
    Xie, Xiaohua
    Lai, Jianhuang
    Zheng, Wei-Shi
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (06) : 753 - 757
  • [9] A Volumetric Saliency Guided Image Summarization for RGB-D Indoor Scene Classification
    Meena, Preeti
    Kumar, Himanshu
    Yadav, Sandeep Kumar
    [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (11) : 10917 - 10929
  • [10] Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images
    Wang, Xiaoqiang
    Zhu, Lei
    Tang, Siliang
    Fu, Huazhu
    Li, Ping
    Wu, Fei
    Yang, Yi
    Zhuang, Yueting
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1107 - 1119