Described Object Detection: Liberating Object Detection with Flexible Expressions

被引:0
|
作者
Xie, Chi [1 ]
Zhang, Zhao [2 ]
Wu, Yixuan [3 ]
Zhu, Feng [2 ]
Zhao, Rui [2 ]
Liang, Shuang [1 ]
机构
[1] Tongji Univ, Shanghai, Peoples R China
[2] Sensetime Res, Hong Kong, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting objects based on language information is a popular task that includes Open-Vocabulary object Detection (OVD) and Referring Expression Comprehension (REC). In this paper, we advance them to a more practical setting called Described Object Detection (DOD) by expanding category names to flexible language expressions for OVD and overcoming the limitation of REC only grounding the pre-existing object. We establish the research foundation for DOD by constructing a Description Detection Dataset (D3). This dataset features flexible language expressions, whether short category names or long descriptions, and annotating all described objects on all images without omission. By evaluating previous SOTA methods on D3, we find some troublemakers that fail current REC, OVD, and bi-functional methods. REC methods struggle with confidence scores, rejecting negative instances, and multi-target scenarios, while OVD methods face constraints with long and complex descriptions. Recent bi-functional methods also do not work well on DOD due to their separated training procedures and inference strategies for REC and OVD tasks. Building upon the aforementioned findings, we propose a baseline that largely improves REC methods by reconstructing the training data and introducing a binary classification sub-task, outperforming existing methods. Data and code are available at this URL and related works are tracked in this repo.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Camouflaged Object Detection
    Fan, Deng-Ping
    Ji, Ge-Peng
    Sun, Guolei
    Cheng, Ming-Ming
    Shen, Jianbing
    Shao, Ling
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 2774 - 2784
  • [22] Saliency and Object Detection
    Kong, Phutphalla
    Mancas, Matei
    Kheang, Seng
    Gosselin, Bernard
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (ICPRAI 2018), 2018, : 523 - 528
  • [23] Active object detection
    de Croon, G.
    VISAPP 2007: Proceedings of the Second International Conference on Computer Vision Theory and Applications, Volume IU/MTSV, 2007, : 97 - 103
  • [24] Object Detection in Flatland
    Jovanovic, Natasa
    Korst, Jan
    Pronk, Verus
    2009 THIRD INTERNATIONAL CONFERENCE ON ADVANCED ENGINEERING COMPUTING AND APPLICATIONS IN SCIENCES (ADVCOMP 2009), 2009, : 95 - +
  • [25] Object Detection in Invoices
    Bulzan, Andrei -Stefan
    Cernazanu-Glavan, Cosmin
    2022 26TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2022, : 656 - 662
  • [26] ADN for object detection
    Wang, Jinding
    Hu, Haifeng
    Lu, Xinlong
    IET COMPUTER VISION, 2020, 14 (02) : 65 - 72
  • [27] THE OBJECT DETECTION EFFECT
    PURCELL, DG
    STEWART, AL
    GIACOLETTI, AM
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1987, 25 (05) : 347 - 347
  • [28] Spinning object detection
    Oliver Graydon
    Nature Photonics, 2013, 7 (10) : 762 - 762
  • [29] Object race detection
    von Praun, C
    Gross, TR
    ACM SIGPLAN NOTICES, 2001, 36 (11) : 70 - 82
  • [30] OBJECT DETECTION ENHANCEMENT
    SILVESTRO, FB
    PHOTOGRAMMETRIC ENGINEERING, 1969, 35 (06): : 555 - +