T-CNN: Tubelets With Convolutional Neural Networks for Object Detection From Videos

被引:272
|
作者
Kang, Kai [1 ]
Li, Hongsheng [2 ]
Yan, Junjie [5 ]
Zeng, Xingyu [5 ]
Yang, Bin [6 ]
Xiao, Tong [1 ]
Zhang, Cong [7 ]
Wang, Zhe [2 ]
Wang, Ruohui [3 ]
Wang, Xiaogang [2 ]
Ouyang, Wanli [4 ]
机构
[1] Chinese Univ Hong Kong, Elect Engn, Hong Kong, Hong Kong, Peoples R China
[2] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Informat Engn, Hong Kong, Hong Kong, Peoples R China
[4] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[5] SenseTime Grp Ltd, Beijing 100000, Peoples R China
[6] Univ Toronto, Comp Sci Dept, Toronto, ON M5S 1A1, Canada
[7] Shanghai Jiao Tong Univ, Inst Image Commun & Informat Proc, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Object detection; neural networks; computer vision; TRACKING;
D O I
10.1109/TCSVT.2017.2736553
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The state-of-the-art performance for object detection has been significantly improved over the past two years. Besides the introduction of powerful deep neural networks, such as GoogleNet and VGG, novel object detection frameworks, such as R-CNN and its successors, Fast R-CNN, and Faster R-CNN, play an essential role in improving the state of the art. Despite their effectiveness on still images, those frameworks are not specifically designed for object detection from videos. Temporal and contextual information of videos are not fully investigated and utilized. In this paper, we propose a deep learning framework that incorporates temporal and contextual information from tubelets obtained in videos, which dramatically improves the baseline performance of existing still-image detection frameworks when they are applied to videos. It is called T-CNN, i.e., tubelets with convolutional neueral networks. The proposed framework won newly introduced an object-detectionfrom-video task with provided data in the ImageNet Large-Scale Visual Recognition Challenge 2015. Code is publicly available at https://githuh.com/royfavouritekk/T-CNN.
引用
收藏
页码:2896 / 2907
页数:12
相关论文
共 50 条
  • [1] Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos
    Hou, Rui
    Chen, Chen
    Shah, Mubarak
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5823 - 5832
  • [2] Object Detection from Video Tubelets with Convolutional Neural Networks
    Kang, Kai
    Ouyang, Wanli
    Li, Hongsheng
    Wang, Xiaogang
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 817 - 825
  • [3] T-CNN: Trilinear convolutional neural networks model for visual detection of plant diseases
    Wang, Dongfang
    Wang, Jun
    Li, Wenrui
    Guan, Ping
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 190
  • [4] TOD-CNN: An effective convolutional neural network for tiny object detection in sperm videos
    Zou, Shuojia
    Li, Chen
    Sun, Hongzan
    Xu, Peng
    Zhang, Jiawei
    Ma, Pingli
    Yao, Yudong
    Huang, Xinyu
    Grzegorzek, Marcin
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 146
  • [5] MOVING OBJECT DETECTION IN VIDEOS USING PRINCIPAL COMPONENT PURSUIT AND CONVOLUTIONAL NEURAL NETWORKS
    Tejada, Enrique D.
    Rodriguez, Paul A.
    [J]. 2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 793 - 797
  • [6] A Lite Asymmetric DenseNet for Effective Object Detection Based on Convolutional Neural Networks (CNN)
    Huang, Long
    Ren, Kun
    Fan, Chunqi
    Deng, Hai
    [J]. OPTOELECTRONIC IMAGING AND MULTIMEDIA TECHNOLOGY VI, 2019, 11187
  • [7] Parallel Convolutional Neural Networks for Object Detection
    Olugboja, Adedeji
    Wang, Zenghui
    Sun, Yanxia
    [J]. JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2021, 12 (04) : 279 - 286
  • [8] Object Detection Using Convolutional Neural Networks
    Galvez, Reagan L.
    Bandala, Argel A.
    Dadios, Elmer P.
    Vicerra, Ryan Rhay P.
    Maningo, Jose Martin Z.
    [J]. PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 2023 - 2027
  • [9] Cascaded Convolutional Neural Networks for Object Detection
    Guo, Yajing
    Guo, Xiaoqiang
    Jiang, Zhuqing
    Zhou, Yun
    [J]. 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [10] Convolutional Neural Networks Based Fire Detection in Surveillance Videos
    Muhammad, Khan
    Ahmad, Jamil
    Mehmood, Irfan
    Rho, Seungmin
    Baik, Sung Wook
    [J]. IEEE ACCESS, 2018, 6 : 18174 - 18183