Multiple-Level Distillation for Video Fine-Grained Accident Detection

被引:0
|
作者
Yu, Hongyang [1 ]
Zhang, Xinfeng [2 ]
Wang, Yaowei [1 ]
Huang, Qingming [2 ]
Yin, Baocai [1 ,3 ]
机构
[1] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100039, Peoples R China
[3] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
基金
中国博士后科学基金;
关键词
Video accident detection; fine-grained accident detection; knowledge distillation; multiple-level distillation; EVENT DETECTION;
D O I
10.1109/TCSVT.2023.3338743
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accident detection in surveillance or dashcam videos is a common task in the field of traffic accident analysis by using videos. However, as accidents occur sparsely and randomly in the real world, the data records are more scarce than the training data for standard detection tasks such as object detection or instance detection. Moreover, the limited and diverse accident data makes it more difficult to model the accident pattern for fine-grained accident detection tasks analyzing the accident in detail. Extra prior information should be introduced in the tasks such as the common vision feature which could offer relatively effective information for many vision tasks. The big model could generate the common vision feature by training on abundant data and consuming a lot of computing time and resources. Even though the accident video data is special, the big model could also extract common vision features. Thus, in this paper, we propose to apply knowledge distillation to fine-grained accident detection which analyzes the spatial temporal existence and severity for solving the issues of complex computing (distillation to the small model) and keeping good performance under limited accident data. Knowledge distillation could offer extra general vision feature information from the pre-trained big model. Common knowledge distillation guides the student network to learn the same representations from the teacher network by logit mimicking or feature imitation. However, single-level distillation could only focus on one aspect of mimicking classification logit or deep features. Multiple tasks with different focuses are required for fine-grained accident detection, such as multiple accident classification, temporal-spatial accident region detection, and accident severity estimation. Thus in this paper, multiple-level distillation is proposed for the different modules to generate the unified video feature concerning all the tasks in fine-grained accident detection analysis. The various experimental results on a fine-grained accident detection dataset which provides more detailed annotations of accidents demonstrate that our method could effectively model the video feature for multiple tasks.
引用
收藏
页码:4445 / 4457
页数:13
相关论文
共 50 条
  • [1] Fine-Grained Accident Detection: Database and Algorithm
    Yu, Hongyang
    Zhang, Xinfeng
    Wang, Yaowei
    Huang, Qingming
    Yin, Baocai
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1059 - 1069
  • [2] Fine-Grained Prototypes Distillation for Few-Shot Object Detection
    Wang, Zichen
    Yang, Bo
    Yue, Haonan
    Ma, Zhenghao
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 6, 2024, : 5859 - 5866
  • [3] Multiple Granularity Analysis for Fine-grained Action Detection
    Ni, Bingbing
    Paramathayalan, Vignesh R.
    Moulin, Pierre
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 756 - 763
  • [4] Fine-grained Audible Video Description
    Shen, Xuyang
    Li, Dong
    Zhou, Jinxing
    Qin, Zhen
    He, Bowen
    Han, Xiaodong
    Li, Aixuan
    Dai, Yuchao
    Kong, Lingpeng
    Wang, Meng
    Qiao, Yu
    Zhong, Yiran
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10585 - 10596
  • [5] Fine-Grained Scalable Video Caching
    Gong, Qiushi
    Woods, John W.
    Kar, Koushik
    Chakareski, Jacob
    [J]. 2015 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2015, : 101 - 106
  • [6] Video Pose Distillation for Few-Shot, Fine-Grained Sports Action Recognition
    Hong, James
    Fisher, Matthew
    Gharbi, Michael
    Fatahalian, Kayvon
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9234 - 9243
  • [7] WebChild 2.0: Fine-Grained Commonsense Knowledge Distillation
    Tandon, Niket
    de Melo, Gerard
    Weikum, Gerhard
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017): SYSTEM DEMONSTRATIONS, 2017, : 115 - 120
  • [8] Fine-Grained Instance-Level Sketch-Based Video Retrieval
    Xu, Peng
    Liu, Kun
    Xiang, Tao
    Hospedales, Timothy M.
    Ma, Zhanyu
    Guo, Jun
    Song, Yi-Zhe
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1995 - 2007
  • [9] Fine-Grained Video Retrieval With Scene Sketches
    Zuo, Ran
    Deng, Xiaoming
    Chen, Keqi
    Zhang, Zhengming
    Lai, Yu-Kun
    Liu, Fang
    Ma, Cuixia
    Wang, Hao
    Liu, Yong-Jin
    Wang, Hongan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3136 - 3149
  • [10] Favor: Fine-Grained Video Rate Adaptation
    He, Jian
    Qureshi, Mubashir Adnan
    Qiu, Lili
    Li, Jin
    Li, Feng
    Han, Lei
    [J]. PROCEEDINGS OF THE 9TH ACM MULTIMEDIA SYSTEMS CONFERENCE (MMSYS'18), 2018, : 64 - 75