共 50 条
- [1] Video Object Detection with an Aligned Spatial-Temporal Memory [J]. COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 494 - 510
- [3] SPATIAL-TEMPORAL FEATURE AGGREGATION NETWORK FOR VIDEO OBJECT DETECTION [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1858 - 1862
- [5] A spatial-temporal approach for video caption detection and recognition [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 961 - 971
- [6] RANDOM-SAMPLING-BASED SPATIAL-TEMPORAL FEATURE FOR CONSUMER VIDEO CONCEPT CLASSIFICATION [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1861 - 1864
- [9] End-to-End Video Object Detection with Spatial-Temporal Transformers [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1507 - 1516