Spotting Temporally Precise, Fine-Grained Events in Video

被引:10
|
作者
Hong, James [1 ]
Zhang, Haotian [1 ]
Gharbi, Michael [2 ]
Fisher, Matthew [2 ]
Fatahalian, Kayvon [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Adobe Res, San Francisco, CA USA
来源
基金
美国国家科学基金会;
关键词
Temporally precise spotting; Video understanding;
D O I
10.1007/978-3-031-19833-5_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the task of spotting temporally precise, fine-grained events in video (detecting the precise moment in time events occur). Precise spotting requires models to reason globally about the full-time scale of actions and locally to identify subtle frame-to-frame appearance and motion differences that identify events during these actions. Surprisingly, we find that top performing solutions to prior video understanding tasks such as action detection and segmentation do not simultaneously meet both requirements. In response, we propose E2E-Spot, a compact, end-to-end model that performs well on the precise spotting task and can be trained quickly on a single GPU. We demonstrate that E2E-Spot significantly outperforms recent baselines adapted from the video action detection, segmentation, and spotting literature to the precise spotting task. Finally, we contribute new annotations and splits to several fine-grained sports action datasets to make these datasets suitable for future work on precise spotting.
引用
收藏
页码:33 / 51
页数:19
相关论文
共 50 条
  • [31] Fine-Grained Video Traffic Classification Based on QoE Values
    Lingyun Yang
    Yuning Dong
    Md. Sohel Rana
    Zaijian Wang
    Wireless Personal Communications, 2018, 103 : 1481 - 1498
  • [32] Fine-Grained Video Traffic Classification Based on QoE Values
    Yang, Lingyun
    Dong, Yuning
    Rana, Md. Sohel
    Wang, Zaijian
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 103 (02) : 1481 - 1498
  • [33] Fine-Grained Length Controllable Video Captioning With Ordinal Embeddings
    Nitta, Tomoya
    Fukuzawa, Takumi
    Tamaki, Toru
    IEEE ACCESS, 2024, 12 : 189667 - 189688
  • [34] FineAction: A Fine-Grained Video Dataset for Temporal Action Localization
    Liu, Yi
    Wang, Limin
    Wang, Yali
    Ma, Xiao
    Qiao, Yu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6937 - 6950
  • [35] Improve Fine-Grained Feature Learning in Fine-Grained DataSet GAI
    Wang, Hai Peng
    Geng, Zhi Qing
    IEEE ACCESS, 2025, 13 : 12777 - 12788
  • [36] Leveraging Fine-Grained Labels to Regularize Fine-Grained Visual Classification
    Wu, Junfeng
    Yao, Li
    Liu, Bin
    Ding, Zheyuan
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION (ICCMS 2019) AND 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND APPLICATIONS (ICICA 2019), 2019, : 133 - 136
  • [37] On the Feasibility of Anomaly Detection with Fine-Grained Program Tracing Events
    Li, Hong-Wei
    Wu, Yu-Sung
    Huang, Yennun
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2022, 30 (02)
  • [38] On the Feasibility of Anomaly Detection with Fine-Grained Program Tracing Events
    Hong-Wei Li
    Yu-Sung Wu
    Yennun Huang
    Journal of Network and Systems Management, 2022, 30
  • [39] FINE-GRAINED MONOLITH
    Louw, Michael
    ARCHITECTURE SOUTH AFRICA, 2019, (96): : 48 - 49
  • [40] Is fine-grained viable?
    Aaldering, M
    EDN, 1997, 42 (02) : 28 - 28