ClearPose: Large-scale Transparent Object Dataset and Benchmark

被引:11
|
作者
Chen, Xiaotong [1 ]
Zhang, Huijie [1 ]
Yu, Zeren [1 ]
Opipari, Anthony [1 ]
Jenkins, Odest Chadwicke [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
来源
关键词
Transparent objects; Depth completion; Pose estimation; Dataset and benchmark;
D O I
10.1007/978-3-031-20074-8_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transparent objects are ubiquitous in household settings and pose distinct challenges for visual sensing and perception systems. The optical properties of transparent objects leave conventional 3D sensors alone unreliable for object depth and pose estimation. These challenges are highlighted by the shortage of large-scale RGB-Depth datasets focusing on transparent objects in real-world settings. In this work, we contribute a large-scale real-world RGB-Depth transparent object dataset named ClearPose to serve as a benchmark dataset for segmentation, scene-level depth completion and object-centric pose estimation tasks. The ClearPose dataset contains over 350K labeled real-world RGB-Depth frames and 5M instance annotations covering 63 household objects. The dataset includes object categories commonly used in daily life under various lighting and occluding conditions as well as challenging test scenarios such as cases of occlusion by opaque or translucent objects, non-planar orientations, presence of liquids, etc. We benchmark several state-of-the-art depth completion and object pose estimation deep neural networks on ClearPose. The dataset and benchmarking source code is available at https://githuh.com/opipari/ClearPose.
引用
收藏
页码:381 / 396
页数:16
相关论文
共 50 条
  • [41] BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
    Cai, Likun
    Zhang, Zhi
    Zhu, Yi
    Zhang, Li
    Li, Mu
    Xue, Xiangyang
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4776 - 4786
  • [42] A Dataset and Benchmark for Large-scale Multi-modal Face Anti-spoofing
    Zhang, Shifeng
    Wang, Xiaobo
    Liu, Ajian
    Zhao, Chenxu
    Wan, Jun
    Escalera, Sergio
    Shi, Hailin
    Wang, Zezheng
    Li, Stan Z.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 919 - 928
  • [43] LaSOT: A High-quality Benchmark for Large-scale Single Object Tracking
    Fan, Heng
    Lin, Liting
    Yang, Fan
    Chu, Peng
    Deng, Ge
    Yu, Sijia
    Bai, Hexin
    Xu, Yong
    Liao, Chunyuan
    Ling, Haibin
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5369 - 5378
  • [44] LaSOT: A High-quality Large-scale Single Object Tracking Benchmark
    Heng Fan
    Hexin Bai
    Liting Lin
    Fan Yang
    Peng Chu
    Ge Deng
    Sijia Yu
    Mingzhen Harshit
    Juehuan Huang
    Yong Liu
    Chunyuan Xu
    Lin Liao
    Haibin Yuan
    [J]. International Journal of Computer Vision, 2021, 129 : 439 - 461
  • [45] CSPC-Dataset: New LiDAR Point Cloud Dataset and Benchmark for Large-Scale Scene Semantic Segmentation
    Tong, Guofeng
    Li, Yong
    Chen, Dong
    Sun, Qi
    Cao, Wei
    Xiang, Guiqiu
    [J]. IEEE ACCESS, 2020, 8 : 87695 - 87718
  • [46] MCMOD: The Multi-Category Large-Scale Dataset for Maritime Object Detection
    Sun, Zihao
    Hu, Xiao
    Qi, Yining
    Huang, Yongfeng
    Li, Songbin
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (01): : 1657 - 1669
  • [47] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
    Zhu, Chenchen
    Xiao, Fanyi
    Alvarado, Andres
    Babaei, Yasmine
    Hu, Jiabo
    El-Mohri, Hichem
    Culatana, Sean Chang
    Sumbaly, Roshan
    Yan, Zhicheng
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 20053 - 20063
  • [48] OmniArt: A Large-scale Artistic Benchmark
    Strezoski, Gjorgji
    Worring, Marcel
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2018, 14 (04)
  • [49] DMDD: A Large-Scale Dataset for Dataset Mentions Detection
    Pan, Huitong
    Zhang, Qi
    Dragut, Eduard
    Caragea, Cornelia
    Latecki, Longin Jan
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1132 - 1146
  • [50] Large-Scale Indoor Visual-Geometric Multimodal Dataset and Benchmark for Novel View Synthesis
    Cao, Junming
    Zhao, Xiting
    Schwertfeger, Soren
    [J]. SENSORS, 2024, 24 (17)