One-Shot Object Affordance Detection in the Wild

被引:0
|
作者
Wei Zhai
Hongchen Luo
Jing Zhang
Yang Cao
Dacheng Tao
机构
[1] University of Science and Technology of China,Institute of Artificial Intelligence
[2] The University of Sydney,undefined
[3] JD Explore Academy,undefined
[4] Hefei Comprehensive National Science Center,undefined
来源
关键词
Affordance detection; One-Shot learning; Human purpose estimation and transfer;
D O I
暂无
中图分类号
学科分类号
摘要
Affordance detection refers to identifying the potential action possibilities of objects in an image, which is a crucial ability for robot perception and manipulation. To empower robots with this ability in unseen scenarios, we first study the challenging one-shot affordance detection problem in this paper, i.e., given a support image that depicts the action purpose, all objects in a scene with the common affordance should be detected. To this end, we devise a One-Shot Affordance Detection Network (OSAD-Net) that firstly estimates the human action purpose and then transfers it to help detect the common affordance from all candidate images. Through collaboration learning, OSAD-Net can capture the common characteristics between objects having the same underlying affordance and learn a good adaptation capability for perceiving unseen affordances. Besides, we build a large-scale purpose-driven affordance dataset v2 (PADv2) by collecting and labeling 30k images from 39 affordance and 103 object categories. With complex scenes and rich annotations, our PADv2 dataset can be used as a test bed to benchmark affordance detection methods and may also facilitate downstream vision tasks, such as scene understanding, action recognition, and robot manipulation. Specifically, we conducted comprehensive experiments on PADv2 dataset by including 11 advanced models from several related research fields. Experimental results demonstrate the superiority of our model over previous representative ones in terms of both objective metrics and visual quality. The benchmark suite is available at https://github.com/lhc1224/OSAD_Net.
引用
收藏
页码:2472 / 2500
页数:28
相关论文
共 50 条
  • [1] One-Shot Object Affordance Detection in the Wild
    Zhai, Wei
    Luo, Hongchen
    Zhang, Jing
    Cao, Yang
    Tao, Dacheng
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (10) : 2472 - 2500
  • [2] One-Shot Affordance Detection
    Luo, Hongchen
    Zhai, Wei
    Zhang, Jing
    Cao, Yang
    Tao, Dacheng
    [J]. PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 895 - 901
  • [3] One-Shot Unsupervised Domain Adaptation for Object Detection
    Wan, Zhiqiang
    Li, Lusi
    Li, Hepeng
    He, Haibo
    Ni, Zhen
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Adaptive Image Transformer for One-Shot Object Detection
    Chen, Ding-Jie
    Hsieh, He-Yen
    Liu, Tyng-Luh
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12242 - 12251
  • [5] OSCD: A one-shot conditional object detection framework
    Fu, Kun
    Zhang, Tengfei
    Zhang, Yue
    Sun, Xian
    [J]. NEUROCOMPUTING, 2021, 425 : 243 - 255
  • [6] Augmentative contrastive learning for one-shot object detection
    Du, Yaoyang
    Liu, Fang
    Jiao, Licheng
    Hao, Zehua
    Li, Shuo
    Liu, Xu
    Liu, Jing
    [J]. NEUROCOMPUTING, 2022, 513 : 13 - 24
  • [7] One-Shot Object Detection in Heterogeneous Artwork Datasets
    Madhu, Prathmesh
    Meyer, Anna
    Zinnen, Mathias
    Muhrenberg, Lara
    Suckow, Dirk
    Bendschus, Torsten
    Reinhardt, Corinna
    Bell, Peter
    Verstegen, Ute
    Kosti, Ronak
    Maier, Andreas
    Christlein, Vincent
    [J]. 2022 ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2022,
  • [8] LAPLACIAN OBJECT: ONE-SHOT OBJECT DETECTION BY LOCALITY PRESERVING PROJECTION
    Biswas, Sujoy Kumar
    Milanfar, Peyman
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 4062 - 4066
  • [9] AROS: Affordance Recognition with One-Shot Human Stances
    Pacheco-Ortega, Abel
    Mayol-Cuevas, Walterio
    [J]. FRONTIERS IN ROBOTICS AND AI, 2023, 10
  • [10] Balanced and Hierarchical Relation Learning for One-shot Object Detection
    Yang, Hanqing
    Cai, Sijia
    Sheng, Hualian
    Deng, Bing
    Huang, Jianqiang
    Hua, Xian-Sheng
    Tang, Yong
    Zhang, Yu
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7581 - 7590