Self-Supervised Temporal Sensitive Hashing for Video Retrieval

被引:0
|
作者
Li, Qihua [1 ]
Tian, Xing [2 ]
Ng, Wing W. Y. [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangdong Prov Key Lab Computat Intelligence & Cyb, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Artificial Intelligence, Guangzhou 510631, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Hash functions; Sensitivity; Perturbation methods; Long short term memory; Transformers; Training; Robustness; Self-supervise; video hashing; video retrieval; transformer; CLASSIFICATION; LSTM;
D O I
10.1109/TMM.2024.3385183
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-supervised video hashing methods retrieve large-scale video data without labels by making full use of visual and temporal information in original videos. Existing methods are not robust enough to handle small temporal differences between similar videos, because of the ignoring of future unseen samples on temporal which leads to large generalization errors. At the same time, existing self-supervised methods cannot preserve pairwise similarity information between large-scale unlabeled data efficiently and effectively. Thus, a self-supervised temporal sensitive video hashing (TSVH) is proposed in the paper for video retrieval. The TSVH uses a transformer-based autoencoder network with temporal sensitivity regularization to achieve low sensitivity of local temporal perturbations and preserve information of global temporal sequence. The pairwise similarity between video samples is effectively preserved by applying a hashing-based affinity matrix in the method. Experiments on realistic datasets show that the TSVH outperforms several state-of-the-art methods and classic methods.
引用
收藏
页码:9021 / 9035
页数:15
相关论文
共 50 条
  • [1] Self-supervised Video Hashing via Bidirectional Transformers
    Li, Shuyan
    Li, Xiu
    Lu, Jiwen
    Zhou, Jie
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13544 - 13553
  • [2] Relational Consistency Induced Self-Supervised Hashing for Image Retrieval
    Jin, Lu
    Li, Zechao
    Pan, Yonghua
    Tang, Jinhui
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 13
  • [3] Deep Contrastive Self-Supervised Hashing for Remote Sensing Image Retrieval
    Tan, Xiaoyan
    Zou, Yun
    Guo, Ziyang
    Zhou, Ke
    Yuan, Qiangqiang
    [J]. REMOTE SENSING, 2022, 14 (15)
  • [4] Sparse graph based self-supervised hashing for scalable image retrieval
    Wang, Weiwei
    Zhang, Haofeng
    Zhang, Zheng
    Liu, Li
    Shao, Ling
    [J]. INFORMATION SCIENCES, 2021, 547 : 622 - 640
  • [5] Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
    Li, Chao
    Deng, Cheng
    Li, Ning
    Liu, Wei
    Gao, Xinbo
    Tao, Dacheng
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4242 - 4251
  • [6] Self-Supervised Graph Convolution for Video Moment Retrieval
    Hu, Xiwen
    Wang, Guolong
    Shan, Shimin
    Liu, Yu
    Li, Jiangquan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 407 - 419
  • [7] Self-Supervised Video Hashing With Hierarchical Binary Auto-Encoder
    Song, Jingkuan
    Zhang, Hanwang
    Li, Xiangpeng
    Gao, Lianli
    Wang, Meng
    Hong, Richang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (07) : 3210 - 3221
  • [8] Autoencoder-based self-supervised hashing for cross-modal retrieval
    Li, Yifan
    Wang, Xuan
    Cui, Lei
    Zhang, Jiajia
    Huang, Chengkai
    Luo, Xuan
    Qi, Shuhan
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 17257 - 17274
  • [9] Autoencoder-based self-supervised hashing for cross-modal retrieval
    Yifan Li
    Xuan Wang
    Lei Cui
    Jiajia Zhang
    Chengkai Huang
    Xuan Luo
    Shuhan Qi
    [J]. Multimedia Tools and Applications, 2021, 80 : 17257 - 17274
  • [10] Self-supervised Video Representation Learning with Cascade Positive Retrieval
    Wu, Cheng-En
    Lai, Farley
    Hu, Yu Hen
    Kadav, Asim
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4079 - 4088