Zero-Shot Object Detection with Textual Descriptions

被引:0
|
作者
Li, Zhihui [1 ]
Yao, Lina [1 ]
Zhang, Xiaoqin [2 ]
Wang, Xianzhi [3 ]
Kanhere, Salil [1 ]
Zhang, Huaxiang [4 ]
机构
[1] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW, Australia
[2] Wenzhou Univ, Coll Math & Informat Sci, Wenzhou, Peoples R China
[3] Univ Technol Sydney, Sch Software, Sydney, NSW, Australia
[4] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object detection is important in real-world applications. Existing methods mainly focus on object detection with sufficient labelled training data or zero-shot object detection with only concept names. In this paper, we address the challenging problem of zero-shot object detection with natural language description, which aims to simultaneously detect and recognize novel concept instances with textual descriptions. We propose a novel deep learning framework to jointly learn visual units, visual-unit attention and word-level attention, which are combined to achieve word-proposal affinity by an element-wise multiplication. To the best of our knowledge, this is the first work on zero-shot object detection with textual descriptions. Since there is no directly related work in the literature, we investigate plausible solutions based on existing zero-shot object detection for a fair comparison. We conduct extensive experiments on three challenging benchmark datasets. The extensive experimental results confirm the superiority of the proposed model.
引用
收藏
页码:8690 / 8697
页数:8
相关论文
共 50 条
  • [1] Zero-Shot Object Detection with Textual Descriptions Using Convolutional Neural Networks
    Zhang, Licheng
    Wang, Xianzhi
    Yao, Lina
    Zheng, Feng
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [2] Zero-Shot Object Detection
    Bansal, Ankan
    Sikka, Karan
    Sharma, Gaurav
    Chellappa, Rama
    Divakaran, Ajay
    [J]. COMPUTER VISION - ECCV 2018, PT I, 2018, 11205 : 397 - 414
  • [3] ZERO-SHOT OBJECT DETECTION WITH TRANSFORMERS
    Zheng, Ye
    Cui, Li
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 444 - 448
  • [4] Zero-Shot Camouflaged Object Detection
    Li, Haoran
    Feng, Chun-Mei
    Xu, Yong
    Zhou, Tao
    Yao, Lina
    Chang, Xiaojun
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5126 - 5137
  • [5] Write a Classifier: Zero-Shot Learning Using Purely Textual Descriptions
    Elhoseiny, Mohamed
    Saleh, Babak
    Elgammal, Ahmed
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2584 - 2591
  • [6] Zero-Shot Object Detection for Indoor Robots
    Abdalwhab, Abdalwhab
    Liu, Huaping
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] Transductive Learning for Zero-Shot Object Detection
    Rahman, Shafin
    Khan, Salman
    Barnes, Nick
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6081 - 6090
  • [8] Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions
    Ba, Jimmy Lei
    Swersky, Kevin
    Fidler, Sanja
    Salakhutdinov, Ruslan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4247 - 4255
  • [9] Synthetic Feature Assessment for Zero-Shot Object Detection
    Dai, Xinmiao
    Wang, Chong
    Li, Haohe
    Lin, Sunqi
    Dong, Li
    Wu, Jiafei
    Wang, Jun
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 444 - 449
  • [10] Zero-Shot Object Counting
    Xu, Jingyi
    Le, Hieu
    Nguyen, Vu
    Ranjan, Viresh
    Samaras, Dimitris
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15548 - 15557