Described Object Detection: Liberating Object Detection with Flexible Expressions

被引:0
|
作者
Xie, Chi [1 ]
Zhang, Zhao [2 ]
Wu, Yixuan [3 ]
Zhu, Feng [2 ]
Zhao, Rui [2 ]
Liang, Shuang [1 ]
机构
[1] Tongji Univ, Shanghai, Peoples R China
[2] Sensetime Res, Hong Kong, Peoples R China
[3] Zhejiang Univ, Hangzhou, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
LANGUAGE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting objects based on language information is a popular task that includes Open-Vocabulary object Detection (OVD) and Referring Expression Comprehension (REC). In this paper, we advance them to a more practical setting called Described Object Detection (DOD) by expanding category names to flexible language expressions for OVD and overcoming the limitation of REC only grounding the pre-existing object. We establish the research foundation for DOD by constructing a Description Detection Dataset (D3). This dataset features flexible language expressions, whether short category names or long descriptions, and annotating all described objects on all images without omission. By evaluating previous SOTA methods on D3, we find some troublemakers that fail current REC, OVD, and bi-functional methods. REC methods struggle with confidence scores, rejecting negative instances, and multi-target scenarios, while OVD methods face constraints with long and complex descriptions. Recent bi-functional methods also do not work well on DOD due to their separated training procedures and inference strategies for REC and OVD tasks. Building upon the aforementioned findings, we propose a baseline that largely improves REC methods by reconstructing the training data and introducing a binary classification sub-task, outperforming existing methods. Data and code are available at this URL and related works are tracked in this repo.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Learning Object Scale With Click Supervision for Object Detection
    Zhang, Liao
    Yan, Yan
    Cheng, Lin
    Wang, Hanzi
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (11) : 1618 - 1622
  • [42] Object Detection in Videos by High Quality Object Linking
    Tang, Peng
    Wang, Chunyu
    Wang, Xinggang
    Liu, Wenyu
    Zeng, Wenjun
    Wang, Jingdong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (05) : 1272 - 1278
  • [43] Object-Aware Domain Generalization for Object Detection
    Lee, Wooju
    Hong, Dasol
    Lim, Hyungtae
    Myung, Hyun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 4, 2024, : 2947 - 2955
  • [44] OBJECT-ORIENTED RELATIONAL DISTILLATION FOR OBJECT DETECTION
    Miao, Shuyu
    Feng, Rui
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1510 - 1514
  • [45] A Global Object Disappearance Attack Scenario on Object Detection
    Li, Zhiang
    Xiao, Xiaoling
    IEEE ACCESS, 2024, 12 : 104938 - 104947
  • [46] Object-fabrication targeted attack for object detection
    Zhang, Xuchong
    Sun, Changfeng
    Han, Haoliang
    Sun, Hongbin
    NEUROCOMPUTING, 2025, 627
  • [47] Techniques for Image Classification, Object Detection and Object Segmentation
    Viitaniemi, Ville
    Laaksonen, Jorma
    VISUAL INFORMATION SYSTEMS: WEB-BASED VISUAL INFORMATION SEARCH AND MANAGEMENT, VISUAL 2008, 2008, 5188 : 231 - 234
  • [48] Object Instance Mining for Weakly Supervised Object Detection
    Lin, Chenhao
    Wang, Siwen
    Xu, Dongqi
    Lu, Yu
    Zhang, Wayne
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11482 - 11489
  • [49] Video Object Detection Guided by Object Blur Evaluation
    Wu, Yujie
    Zhang, Hong
    Li, Yawei
    Yang, Yifan
    Yuan, Ding
    IEEE ACCESS, 2020, 8 : 208554 - 208565
  • [50] Combination of Object Tracking and Object Detection for Animal Recognition
    Williams, Francis
    Kuncheva, Ludmila I.
    Rodriguez, Juan J.
    Hennessey, Samuel L.
    2022 IEEE 5TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING APPLICATIONS AND SYSTEMS, IPAS, 2022,