Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval

被引:9
|
作者
Li, Pandeng [1 ]
Xie, Hongtao [1 ]
Ge, Jiannan [1 ]
Zhang, Lei [2 ]
Min, Shaobo [3 ]
Zhang, Yongdong [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Kuaishou Technol, Beijing, Peoples R China
[3] Tencent Data Platform, Shenzhen, Peoples R China
来源
关键词
Unsupervised video retrieval; Dual-stream hashing;
D O I
10.1007/978-3-031-19781-9_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised video hashing usually optimizes binary codes by learning to reconstruct input videos. Such reconstruction constraint spends much effort on frame-level temporal context changes without focusing on video-level global semantics that are more useful for retrieval. Hence, we address this problem by decomposing video information into reconstruction-dependent and semantic-dependent information, which disentangles the semantic extraction from reconstruction constraint. Specifically, we first design a simple dual-stream structure, including a temporal layer and a hash layer. Then, with the help of semantic similarity knowledge obtained from self-supervision, the hash layer learns to capture information for semantic retrieval, while the temporal layer learns to capture the information for reconstruction. In this way, the model naturally preserves the disentangled semantics into binary codes. Validated by comprehensive experiments, our method consistently out-performs the state-of-the-arts on three video benchmarks.
引用
收藏
页码:181 / 197
页数:17
相关论文
共 50 条
  • [1] DSCEH: Dual-Stream Correlation-Enhanced Deep Hashing for Image Retrieval
    Yang, Yulin
    Chen, Huizhen
    Liu, Rongkai
    Liu, Shuning
    Zhan, Yu
    Hu, Chao
    Shi, Ronghua
    [J]. MATHEMATICS, 2024, 12 (14)
  • [2] Discriminative dual-stream deep hashing for large-scale image retrieval
    Ding, Yujuan
    Wong, Wai Keung
    Lai, Zhihui
    Zhang, Zheng
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [3] Dual-stream Co-enhanced Network for Unsupervised Video Object Segmentation
    Zhu, Hongliang
    Yin, Hui
    Liu, Yanting
    Chen, Ning
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (04): : 938 - 958
  • [4] Neighborhood Preserving Hashing for Scalable Video Retrieval
    Li, Shuyan
    Chen, Zhixiang
    Lu, Jiwen
    Li, Xiu
    Zhou, Jie
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8211 - 8220
  • [5] Dual-Stream Recurrent Neural Network for Video Captioning
    Xu, Ning
    Liu, An-An
    Wong, Yongkang
    Zhang, Yongdong
    Nie, Weizhi
    Su, Yuting
    Kankanhalli, Mohan
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2482 - 2493
  • [6] Video Retrieval with Similarity-Preserving Deep Temporal Hashing
    Shen, Ling
    Hong, Richang
    Zhang, Haoran
    Tian, Xinmei
    Wang, Meng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (04)
  • [7] SDGNN: Symmetry-Preserving Dual-Stream Graph Neural Networks
    Chen, Jiufang
    Yuan, Ye
    Luo, Xin
    [J]. IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2024, 11 (07) : 1717 - 1719
  • [8] Compressed Video Action Recognition With Dual-Stream and Dual-Modal Transformer
    Mou, Yuting
    Jiang, Xinghao
    Xu, Ke
    Sun, Tanfeng
    Wang, Zepeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3299 - 3312
  • [9] SDGNN: Symmetry-Preserving Dual-Stream Graph Neural Networks
    Jiufang Chen
    Ye Yuan
    Xin Luo
    [J]. IEEE/CAA Journal of Automatica Sinica, 2024, 11 (07) : 1717 - 1719
  • [10] Unsupervised Rank-Preserving Hashing for Large-Scale Image Retrieval
    Kararnan, Svebor
    Lin, Xudong
    Hu, Xuefeng
    Chang, Shih-Fu
    [J]. ICMR'19: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2019, : 192 - 196