Time-Frequency Mutual Learning for Moment Retrieval and Highlight Detection

被引:0
|
作者
Zhong, Yaokun [1 ]
Liang, Tianming [1 ]
Hu, Jian-Fang [1 ,2 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Guangdong Prov Key Lab Informat Secur Technol, Guangzhou, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
关键词
video moment retrieval; frequency-domain deep learning; deep mutual learning;
D O I
10.1007/978-981-97-8620-6_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Moment Retrieval and Highlight Detection (MR/HD) aims to concurrently retrieve relevant moments and predict clip-wise saliency scores according to a given textual query. Previous MR/HD works have overlooked explicit modeling of static-dynamic visual information described by the language query, which could lead to inaccurate predictions especially when the queried event describes both static appearances and dynamic motions. In this work, we consider learning the static interaction and dynamic reasoning from the time domain and frequency domain respectively, and propose a novel Time-Frequency Mutual Learning framework (TFML) which mainly consists of a time-domain branch, a frequency-domain branch, and a time-frequency aggregation branch. The time-domain branch learns to attend to the static visual information related to the textual query. In the frequency-domain branch, we introduce the Short-Time Fourier Transform (STFT) for dynamic modeling by attending to the frequency contents within varied segments. The time-frequency aggregation branch integrates the information from these two branches. To promote the mutual complementation of time-domain and frequency-domain information, we further employ a mutual learning strategy in concise and effective two-way loop, which enables the branches to collaboratively reason and achieve time-frequency consistent prediction. Extensive experiments on QVHighlights and TVSum demonstrate the effectiveness of our proposed framework as compared with state-of-the-art methods.
引用
收藏
页码:34 / 48
页数:15
相关论文
共 50 条
  • [1] A measure of mutual information on the time-frequency plane
    Aviyente, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 481 - 484
  • [2] Query-Dependent Video Representation for Moment Retrieval and Highlight Detection
    Moon, WonJun
    Hyun, Sangeek
    Park, SangUk
    Park, Dongchan
    Heo, Jae-Pil
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23023 - 23033
  • [3] TIME-FREQUENCY LEARNING MACHINES FOR NONSTATIONARITY DETECTION USING SURROGATES
    Amoud, Hassan
    Honeine, Paul
    Richard, Cedric
    Borgnat, Pierre
    Flandrin, Patrick
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 565 - +
  • [4] Time-frequency learning machines
    Honeine, Paul
    Richard, Cedric
    Flandrin, Patrick
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (07) : 3930 - 3936
  • [5] On the time-frequency detection of chirps
    Chassande-Mottin, E
    Flandrin, P
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 1999, 6 (02) : 252 - 281
  • [6] Mutual estimates of time-frequency representations and uncertainty principles
    Albanese, Angela A.
    Mele, Claudio
    Oliaro, Alessandro
    ANNALI DI MATEMATICA PURA ED APPLICATA, 2024, : 667 - 691
  • [7] MS-DETR: Exploiting Modality Synergy for Moment Retrieval and Highlight Detection
    Chen, Luyuan
    Huang, Jing
    Kong, Ming
    Liang, Tian
    Zhu, Qiang
    Wu, Jianwu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 416 - 429
  • [8] Temporal refinement and multi-grained matching for moment retrieval and highlight detection
    Zhu, Cunjuan
    Zhang, Yanyi
    Jia, Qi
    Wang, Weimin
    Liu, Yu
    MULTIMEDIA SYSTEMS, 2025, 31 (01)
  • [9] Seismic Damage Detection of Moment Resisting Frame Structures Using Time-Frequency Features
    Tao, Dongwang
    Ma, Qiang
    Li, Shanyou
    SHOCK AND VIBRATION, 2018, 2018
  • [10] Higher moment connectedness of cryptocurrencies: a time-frequency approach
    Kingstone Nyakurukwa
    Yudhvir Seetharam
    Journal of Economics and Finance, 2023, 47 (3) : 793 - 814