Detection and tracking based tubelet generation for video object detection

被引:5
|
作者
Wang, Bin [1 ,2 ]
Tang, Sheng [1 ]
Xiao, Jun-Bin [1 ,2 ]
Yan, Quan-Feng [3 ]
Zhang, Yong-Dong [1 ]
机构
[1] Chinese Acad Sci, Key Lab Intelligent Informat Proc, Inst Comp Technol, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Hunan Inst Sci & Technol, Coll Comp Sci, Yueyang 414006, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; Tubelet generation; Tubelet fusion;
D O I
10.1016/j.jvcir.2018.11.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video object detection (VID) is a more challenging task compared with still-image object detection, which not only needs to detect objects accurately per frame but also needs to track objects for a long period of time. In order to detect objects from videos, we propose a Detection And Tracking (DAT) based tubelet generation framework. Under this framework, we first propose a detection-based tubelet generation method which can generate tubelets with more accurate bounding boxes compared with traditional tracking-based methods. On the other hand, the latter can produce a higher recall of bounding boxes than the former in general. To take advantage of their complementary attributes, we further propose a novel tubelet fusion method to combine these multi-modal information (appearance information in independent images and contextual information in videos). Our extensive experiments on the well-known ILSVRC 2016 VID dataset show that our proposed method can achieve state-of-the-art performances. (C) 2018 Elsevier Inc. All rights reserved.
引用
收藏
页码:102 / 111
页数:10
相关论文
共 50 条
  • [1] Video-based Human-Object Interaction Detection from Tubelet Tokens
    Tu, Danyang
    Sun, Wei
    Min, Xiongkuo
    Zhai, Guangtao
    Shen, Wei
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] Object Detection in Videos with Tubelet Proposal Networks
    Kang, Kai
    Li, Hongsheng
    Xiao, Tong
    Ouyang, Wanli
    Yan, Junjie
    Liu, Xihui
    Wang, Xiaogang
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 889 - 897
  • [3] Video Object Detection by Model-Based Tracking
    Huang, De-Kai
    Chen, Kwang-Yu
    Cheng, Shyi-Chyi
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 2384 - 2387
  • [4] TubeR: Tubelet Transformer for Video Action Detection
    Zhao, Jiaojiao
    Zhang, Yanyi
    Li, Xinyu
    Chen, Hao
    Shuai, Bing
    Xu, Mingze
    Liu, Chunhui
    Kundu, Kaustav
    Xiong, Yuanjun
    Modolo, Davide
    Marsic, Ivan
    Snoek, Cees G. M.
    Tighe, Joseph
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 13588 - 13597
  • [5] Video Object Detection with MeanShift Tracking
    Zhang, Shuai
    Liu, Wei
    Fu, Haijie
    Yue, Xiaodong
    [J]. ROUGH SETS, IJCRS 2022, 2022, 13633 : 224 - 237
  • [6] Motion-Based Multiple Object Detection and Tracking in Video
    Sadura, Piotr
    [J]. 2021 SIGNAL PROCESSING SYMPOSIUM (SPSYMPO), 2021, : 248 - 251
  • [7] Object Detection and Tracking based on Trajectory in Broadcast Tennis Video
    Archana, M.
    Geetha, M. Kalaisevi
    [J]. SECOND INTERNATIONAL SYMPOSIUM ON COMPUTER VISION AND THE INTERNET (VISIONNET'15), 2015, 58 : 225 - 232
  • [8] Video-based vehicle tracking based on moving object detection
    Yang, Min
    Pei, Ming-Tao
    Wang, Yong-Jie
    Dong, Zhen
    Wu, Yu-Wei
    [J]. Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2014, 34 (04): : 370 - 375
  • [9] A TOOL FOR FAST GROUND TRUTH GENERATION FOR OBJECT DETECTION AND TRACKING FROM VIDEO
    Comaschi, Francesco
    Stuijk, Sander
    Basten, Twan
    Corporaal, Henk
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 368 - 372
  • [10] Performance evaluation of object detection and tracking in video
    Manohar, V
    Soundararajan, P
    Raju, H
    Goldgof, D
    Kasturi, R
    Garofolo, J
    [J]. COMPUTER VISION - ACCV 2006, PT II, 2006, 3852 : 151 - 161