Eliminating and mining strategies for open-world object proposal

被引:0
|
作者
Wang, Cheng [1 ,2 ]
Wang, Guoli [3 ]
Zhang, Qian [3 ]
Guo, Peng [2 ]
Liu, Wenyu [2 ]
Wang, Xinggang [2 ]
机构
[1] Huazhong Univ Sci & Technol, Inst Artificial Intelligence, Wuhan 430074, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Peoples R China
[3] Horizon Robot, Beijing 100086, Peoples R China
基金
中国国家自然科学基金;
关键词
Open-world object proposal; Eliminate ambiguity; Mine pseudo labels;
D O I
10.1016/j.neucom.2024.128026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object proposal serves as a crucial pre -task of many image and video understanding applications. However, modern approaches for object proposal are typically based on closed -world assumptions, focusing only on predefined categories. This approach cannot meet the diverse needs of real -world applications. To address this limitation, we introduce two strategies, namely the eliminating strategy and the mining strategy, to robustly train the Object Localization Network (OLN) for open -world object proposal. The eliminating strategy takes into account the spatial configuration between labeled boxes, thereby eliminating box anchors that overlap with multiple objects. The mining strategy employs a pseudo -label guided self -training scheme, enabling the mining of object boxes in novel categories. Without bells and whistles, our proposed method outperforms previous state-of-the-art methods on large-scale benchmarks, including COCO, Objects365, and UVO. The source codes are available at https://github.com/hustvl/EM-OLN.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A New Multinetwork Mean Distillation Loss Function for Open-World Domain Incremental Object Detection
    Yang, Jing
    Yuan, Kun
    Chen, Suhao
    Li, Qinglang
    Li, Shaobo
    Zhang, Xiuhua
    Li, Bin
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023
  • [32] Open-World Object Manipulation using Pre-Trained Vision-Language Models
    Stone, Austin
    Xiao, Ted
    Lu, Yao
    Gopalakrishnan, Keerthana
    Lee, Kuang-Huei
    Quan Vuong
    Wohlhart, Paul
    Kirmani, Sean
    Zitkovich, Brianna
    Xia, Fei
    Finn, Chelsea
    Hausman, Karol
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [33] OW-Adapter: Human-Assisted Open-World Object Detection with a Few Examples
    Jamonnak, Suphanut
    Guo, Jiajing
    He, Wenbin
    Gou, Liang
    Ren, Liu
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (01) : 694 - 704
  • [34] Open-World Learning for Traffic Scenarios Categorisation
    Balasubramanian, Lakshman
    Wurst, Jonas
    Botsch, Michael
    Deng, Ke
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (05): : 3506 - 3521
  • [35] Open-World Mission Specification for Reactive Robots
    Maniatopoulos, Spyros
    Blair, Matthew
    Finucane, Cameron
    Kress-Gazit, Hadas
    2014 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2014, : 4328 - 4334
  • [36] Distributed Affordance: An Open-World Assumption for Hypermedia
    Verborgh, Ruben
    Hausenblas, Michael
    Steiner, Thomas
    Mannens, Erik
    Van de Walle, Rik
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 1399 - 1406
  • [37] Open-World Planning Algorithm Based on Logic
    Gao, Jie
    Liu, Ya-song
    Bian, Rui
    PROCEEDINGS OF THE 2015 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL 1, 2016, 359 : 77 - 84
  • [38] Open-World Virtual Reality Headset Tracking
    Humphreys, Todd E.
    Kor, Ronnie Xian Thong
    Iannucci, Peter A.
    Yoder, James E.
    PROCEEDINGS OF THE 33RD INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2020), 2020, : 2931 - 2947
  • [39] Toward open-world software: Issues and challenges
    Baresi, Luciano
    Di Nitto, Ellsabetta
    Ghezzi, Carlo
    COMPUTER, 2006, 39 (10) : 36 - +
  • [40] Toward Open-World Software: Issue and challenges
    Baresi, Luciano
    Di Nitto, Elisabetta
    Ghezzi, Carlo
    30TH ANNUAL IEEE/NASA SOFTWARE ENGINEERING WORKSHOP, PROCEEDINGS, 2006, : 249 - 249