Joint learning of object and action detectors

被引:24
|
作者
Kalogeiton, Vicky [1 ,2 ]
Weinzaepfel, Philippe [3 ]
Ferrari, Vittorio [2 ]
Schmid, Cordelia [1 ]
机构
[1] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[3] Naver Labs Europe, Meylan, France
关键词
D O I
10.1109/ICCV.2017.219
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While most existing approaches for detection in videos focus on objects or human actions separately, we aim at jointly detecting objects performing actions, such as cat eating or dog jumping. We introduce an end-to-end multi-task objective that jointly learns object-action relationships. We compare it with different training objectives, validate its effectiveness for detecting objects-actions in videos, and show that both tasks of object and action detection benefit from this joint learning. Moreover, the proposed architecture can be used for zero-shot learning of actions: our multitask objective leverages the commonalities of an action performed by different objects, e.g. dog and cat jumping, enabling to detect actions of an object without training with these object-actions pairs. In experiments on the A2D dataset [50], we obtain state-of-the-art results on segmentation of object-action pairs. We finally apply our multitask architecture to detect visual relationships between objects in images of the VRD dataset [24].
引用
收藏
页码:2001 / 2010
页数:10
相关论文
共 50 条
  • [31] Visual Object Tracking via Joint Learning Method
    Tian, Wei
    Lv, Jingyuan
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 1163 - 1167
  • [32] Joint Learning for Multi-class Object Detection
    Fard, Hamidreza Odabai
    Chaouch, Mohamed
    Quoc-cuong Pham
    Vacavant, Antoine
    Chateau, Thierry
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 104 - 112
  • [33] Joint Learning of Semantic Alignment and Object Landmark Detection
    Jeon, Sangryul
    Min, Dongbo
    Kim, Seungryong
    Sohn, Kwanghoon
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7293 - 7302
  • [34] Deep Joint Task Learning for Generic Object Extraction
    Wang, Xiaolong
    Zhang, Liliang
    Lin, Liang
    Liang, Zhujin
    Zuo, Wangmeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [35] Joint Inductive and Transductive Learning for Video Object Segmentation
    Mao, Yunyao
    Wang, Ning
    Zhou, Wengang
    Li, Houqiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9650 - 9659
  • [36] Incremental learning of object detection with output merging of compact expert detectors
    Zhang, Nan
    Sun, Zhigang
    Zhang, Kai
    Xiao, Li
    2021 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AUTONOMOUS SYSTEMS (ICOIAS 2021), 2021, : 1 - 7
  • [37] Learning Deep Object Detectors from 3D Models
    Peng, Xingchao
    Sun, Baochen
    Ali, Karim
    Saenko, Kate
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1278 - 1286
  • [38] Performance evaluation of deep learning object detectors for weed detection for cotton
    Rahman, Abdur
    Lu, Yuzhen
    Wang, Haifeng
    SMART AGRICULTURAL TECHNOLOGY, 2023, 3
  • [39] Faster ILOD: Incremental learning for object detectors based on faster RCNN
    Peng, Can
    Zhao, Kun
    Lovell, Brian C.
    PATTERN RECOGNITION LETTERS, 2020, 140 : 109 - 115
  • [40] Interactive Data Collection for Deep Learning Object Detectors on Humanoid Robots
    Maiettini, Elisa
    Pasquale, Giulia
    Rosasco, Lorenzo
    Natale, Lorenzo
    2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), 2017, : 862 - 868