HASSOD: Hierarchical Adaptive Self-Supervised Object Detection

被引:0
|
作者
Cao, Shengcao [1 ]
Joshi, Dhiraj [2 ]
Gui, Liang-Yan [1 ]
Wang, Yu-Xiong [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] IBM Res, Armonk, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The human visual perception system demonstrates exceptional capabilities in learning without explicit supervision and understanding the part-to-whole composition of objects. Drawing inspiration from these two abilities, we propose Hierarchical Adaptive Self-Supervised Object Detection (HASSOD), a novel approach that learns to detect objects and understand their compositions without human supervision. HASSOD employs a hierarchical adaptive clustering strategy to group regions into object masks based on self-supervised visual representations, adaptively determining the number of objects per image. Furthermore, HASSOD identifies the hierarchical levels of objects in terms of composition, by analyzing coverage relations between masks and constructing tree structures. This additional self-supervised learning task leads to improved detection performance and enhanced interpretability. Lastly, we abandon the inefficient multi-round self-training process utilized in prior methods and instead adapt the Mean Teacher framework from semi-supervised learning, which leads to a smoother and more efficient training process. Through extensive experiments on prevalent image datasets, we demonstrate the superiority of HASSOD over existing methods, thereby advancing the state of the art in self-supervised object detection. Notably, we improve Mask AR from 20.2 to 22.5 on LVIS, and from 17.0 to 26.0 on SA-1B. Project page: https://HASSOD- NeurIPS23.github.io.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Object Detection with Self-Supervised Scene Adaptation
    Zhang, Zekun
    Hoai, Minh
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 21589 - 21599
  • [2] Self-Supervised Object Detection from Egocentric Videos
    Akiva, Peri
    Huang, Jing
    Liang, Kevin J.
    Kovvuri, Rama
    Chen, Xingyu
    Feiszli, Matt
    Dana, Kristin
    Hassner, Tal
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5202 - 5214
  • [3] Self-Supervised Reinforcement Learning for Active Object Detection
    Fang, Fen
    Liang, Wenyu
    Wu, Yan
    Xu, Qianli
    Lim, Joo-Hwee
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04): : 10224 - 10231
  • [4] Hierarchical Detection of Network Anomalies : A Self-Supervised Learning Approach
    Kye, Hyoseon
    Kim, Miru
    Kwon, Minhae
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1908 - 1912
  • [5] Hierarchical Detection of Network Anomalies : A Self-Supervised Learning Approach
    Kye, Hyoseon
    Kim, Miru
    Kwon, Minhae
    [J]. IEEE Signal Processing Letters, 2022, 29 : 1908 - 1912
  • [6] Single-shot self-supervised object detection in microscopy
    Midtvedt, Benjamin
    Pineda, Jesus
    Skarberg, Fredrik
    Olsen, Erik
    Bachimanchi, Harshith
    Wesen, Emelie
    Esbjorner, Elin K. K.
    Selander, Erik
    Hook, Fredrik
    Midtvedt, Daniel
    Volpe, Giovanni
    [J]. NATURE COMMUNICATIONS, 2022, 13 (01)
  • [7] Self-Supervised Object Detection and Retrieval Using Unlabeled Videos
    Amrani, Elad
    Ben-Ari, Rami
    Shapira, Inbar
    Hakim, Tal
    Bronstein, Alex
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4100 - 4108
  • [8] Single-shot self-supervised object detection in microscopy
    Benjamin Midtvedt
    Jesús Pineda
    Fredrik Skärberg
    Erik Olsén
    Harshith Bachimanchi
    Emelie Wesén
    Elin K. Esbjörner
    Erik Selander
    Fredrik Höök
    Daniel Midtvedt
    Giovanni Volpe
    [J]. Nature Communications, 13
  • [9] Self-Supervised Object Detection via Generative Image Synthesis
    Mustikovela, Siva Karthik
    De Mello, Shalini
    Prakash, Aayush
    Iqbal, Umar
    Liu, Sifei
    Thu Nguyen-Phuoc
    Rother, Carsten
    Kautz, Jan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8589 - 8598
  • [10] Self-Supervised Feature Augmentation for Large Image Object Detection
    Pan, Xingjia
    Tang, Fan
    Dong, Weiming
    Gu, Yang
    Song, Zhichao
    Meng, Yiping
    Xu, Pengfei
    Deussen, Oliver
    Xu, Changsheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 6745 - 6758