Future Feature-Based Supervised Contrastive Learning for Streaming Perception

被引:0
|
作者
Wang, Tongbo [1 ]
Huang, Hua [2 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100081, Peoples R China
[2] Beijing Normal Univ, Sch Artificial Intelligence, Beijing 100875, Peoples R China
关键词
Streaming media; Object detection; Contrastive learning; Feature extraction; Accuracy; Task analysis; Real-time systems; Video object detection; streaming perception; supervised contrastive learning; appearance features;
D O I
10.1109/TCSVT.2024.3439692
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Streaming perception, a critical task in computer vision, involves the real-time prediction of object locations within video sequences based on prior frames. While current methods like StreamYOLO mainly rely on coordinate information, they often fall short of delivering precise predictions due to feature misalignment between input data and supervisory labels. In this paper, a novel method, Future Feature-based Supervised Contrastive Learning (FFSCL), is introduced to address this challenge by incorporating appearance features from future frames and leveraging supervised contrastive learning techniques. FFSCL establishes a robust correspondence between the appearance of an object in current and past frames and its location in the subsequent frame. This integrated method significantly improves the accuracy of object position prediction in streaming perception tasks. In addition, the FFSCL method includes a sample pair construction module (SPC) for the efficient creation of positive and negative samples based on future frame labels and a feature consistency loss (FCL) to enhance the effectiveness of supervised contrastive learning by linking appearance features from future frames with those from past frames. The efficacy of FFSCL is demonstrated through extensive experiments on two large-scale benchmark datasets, where FFSCL consistently outperforms state-of-the-art methods in streaming perception tasks. This study represents a significant advancement in the incorporation of supervised contrastive learning techniques and future frame information into the realm of streaming perception, paving the way for more accurate and efficient object prediction within video streams.
引用
收藏
页码:13611 / 13625
页数:15
相关论文
共 50 条
  • [41] Supervised Spatially Contrastive Learning
    Nakashima, Kodai
    Kataoka, Hirokatsu
    Iwata, Kenji
    Suzuki, Ryota
    Satoh, Yutaka
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2022, 88 (01): : 66 - 71
  • [42] Weakly Supervised Contrastive Learning
    Zheng, Mingkai
    Wang, Fei
    You, Shan
    Qian, Chen
    Zhang, Changshui
    Wang, Xiaogang
    Xu, Chang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10022 - 10031
  • [43] A feature-based learning method for theorem proving
    Fuchs, M
    FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, 1998, : 457 - 462
  • [44] IPCL: ITERATIVE PSEUDO-SUPERVISED CONTRASTIVE LEARNING TO IMPROVE SELF-SUPERVISED FEATURE REPRESENTATION
    Kumar, Sonal
    Phukan, Anirudh
    Sur, Arijit
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 6270 - 6274
  • [45] Supervised feature-based classification of multi-channel SAR images
    Borghys, D
    Yvinec, Y
    Perneela, C
    Pizurica, A
    Philips, W
    PATTERN RECOGNITION LETTERS, 2006, 27 (04) : 252 - 258
  • [46] EEG AND MEG SOURCE ANALYSIS OF EMOTION AND FEATURE-BASED PERCEPTION
    Farkas, Andrew
    Gehr, Matt
    Delaney, Ansley
    Junghoefer, Markus
    Sabatinelli, Dean
    PSYCHOPHYSIOLOGY, 2022, 59 : S100 - S101
  • [47] Feature-based influences of attention on the perception of proto-objects
    Stojanoski, B
    Niemeier, M
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2005, : 185 - 185
  • [48] Literature mining of host-pathogen interactions: comparing feature-based supervised learning and language-based approaches
    Thanh Thieu
    Joshi, Sneha
    Warren, Samantha
    Korkin, Dmitry
    BIOINFORMATICS, 2012, 28 (06) : 867 - 875
  • [49] The timing of feature-based attentional effects during object perception
    Stojanoski, Boge
    Niemeier, Matthias
    NEUROPSYCHOLOGIA, 2011, 49 (12) : 3406 - 3418
  • [50] URS: An Unsupervised Radargram Segmentation Network Based on Self-Supervised ViT With Contrastive Feature Learning Framework
    Ghosh, Raktim
    Bovolo, Francesca
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 15512 - 15524