Time-Frequency Mutual Learning for Moment Retrieval and Highlight Detection

被引:0
|
作者
Zhong, Yaokun [1 ]
Liang, Tianming [1 ]
Hu, Jian-Fang [1 ,2 ,3 ]
机构
[1] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Guangdong Prov Key Lab Informat Secur Technol, Guangzhou, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Adv Comp, Guangzhou, Peoples R China
关键词
video moment retrieval; frequency-domain deep learning; deep mutual learning;
D O I
10.1007/978-981-97-8620-6_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Moment Retrieval and Highlight Detection (MR/HD) aims to concurrently retrieve relevant moments and predict clip-wise saliency scores according to a given textual query. Previous MR/HD works have overlooked explicit modeling of static-dynamic visual information described by the language query, which could lead to inaccurate predictions especially when the queried event describes both static appearances and dynamic motions. In this work, we consider learning the static interaction and dynamic reasoning from the time domain and frequency domain respectively, and propose a novel Time-Frequency Mutual Learning framework (TFML) which mainly consists of a time-domain branch, a frequency-domain branch, and a time-frequency aggregation branch. The time-domain branch learns to attend to the static visual information related to the textual query. In the frequency-domain branch, we introduce the Short-Time Fourier Transform (STFT) for dynamic modeling by attending to the frequency contents within varied segments. The time-frequency aggregation branch integrates the information from these two branches. To promote the mutual complementation of time-domain and frequency-domain information, we further employ a mutual learning strategy in concise and effective two-way loop, which enables the branches to collaboratively reason and achieve time-frequency consistent prediction. Extensive experiments on QVHighlights and TVSum demonstrate the effectiveness of our proposed framework as compared with state-of-the-art methods.
引用
收藏
页码:34 / 48
页数:15
相关论文
共 50 条
  • [11] Higher moment connectedness of cryptocurrencies: a time-frequency approach
    Nyakurukwa, Kingstone
    Seetharam, Yudhvir
    JOURNAL OF ECONOMICS AND FINANCE, 2023, 47 (03) : 793 - 814
  • [12] ONLINE LEARNING OF TIME-FREQUENCY PATTERNS
    Ruiz-Munoz, Jose F.
    Raich, Raviv
    Orozco-Alzate, Mauricio
    Fern, Xiaoli Z.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2811 - 2815
  • [13] Multiridge detection and time-frequency reconstruction
    Carmona, RA
    Hwang, WL
    Torrésani, B
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1999, 47 (02) : 480 - 492
  • [14] A time-frequency approach for spike detection
    Hassanpour, H
    Mesbah, M
    Boashash, B
    ICECS 2003: PROCEEDINGS OF THE 2003 10TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS, VOLS 1-3, 2003, : 56 - 59
  • [15] Multiridge detection and time-frequency reconstruction
    Princeton Univ, Princeton, United States
    IEEE Trans Signal Process, 2 (480-492):
  • [16] Time-frequency detection of gravitational waves
    Anderson, WG
    Balasubramanian, R
    PHYSICAL REVIEW D, 1999, 60 (10):
  • [17] A TIME-FREQUENCY FORMULATION OF OPTIMUM DETECTION
    FLANDRIN, P
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (09): : 1377 - 1384
  • [18] A Study on Instantaneous Time-Frequency Methods for Damage Detection of Nonlinear Moment-Resisting Frames
    Darvishan, Ehsan
    Amiri, Gholamreza Ghodrati
    Ghaderi, Pedram
    SHOCK AND VIBRATION, 2014, 2014
  • [19] Modality-Aware Heterogeneous Graph for Joint Video Moment Retrieval and Highlight Detection
    Wang, Ruomei
    Feng, Jiawei
    Zhang, Fuwei
    Luo, Xiaonan
    Luo, Yuanmao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 8896 - 8911
  • [20] Transfer learning based bridge damage detection: Leveraging time-frequency features
    Talaei, Saeid
    Zhu, Xinqun
    Li, Jianchun
    Yu, Yang
    Chan, Tommy H. T.
    STRUCTURES, 2023, 57