Cross-media similarity metric learning with unified deep networks

被引:0
|
作者
Jinwei Qi
Xin Huang
Yuxin Peng
机构
[1] Peking University,Institute of Computer Science and Technology
来源
关键词
Cross-media retrieval; Representation learning; Metric learning;
D O I
暂无
中图分类号
学科分类号
摘要
As a highlighting research topic in the multimedia area, cross-media retrieval aims to capture the complex correlations among multiple media types. Learning better shared representation and distance metric for multimedia data is important to boost the cross-media retrieval. Motivated by the strong ability of deep neural network in feature representation and comparison functions learning, we propose the Unified Network for Cross-media Similarity Metric (UNCSM) to associate cross-media shared representation learning with distance metric in a unified framework. First, we design a two-pathway deep network pretrained with contrastive loss, and employ double triplet similarity loss for fine-tuning to learn the shared representation for each media type by modeling the relative semantic similarity. Second, the metric network is designed for effectively calculating the cross-media similarity of the shared representation, by modeling the pairwise similar and dissimilar constraints. Compared to the existing methods which mostly ignore the dissimilar constraints and only use sample distance metric as Euclidean distance separately, our UNCSM approach unifies the representation learning and distance metric to preserve the relative similarity as well as embrace more complex similarity functions for further improving the cross-media retrieval accuracy. The experimental results show that our UNCSM approach outperforms 8 state-of-the-art methods on 4 widely-used cross-media datasets.
引用
收藏
页码:25109 / 25127
页数:18
相关论文
共 50 条
  • [1] Cross-media similarity metric learning with unified deep networks
    Qi, Jinwei
    Huang, Xin
    Peng, Yuxin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (23) : 25109 - 25127
  • [2] Internet cross-media retrieval based on deep learning
    Jiang, Bin
    Yang, Jiachen
    Lv, Zhihan
    Tian, Kun
    Meng, Qinggang
    Yan, Yan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 356 - 366
  • [3] Cross-media retrieval with collective deep semantic learning
    Bin Zhang
    Lei Zhu
    Jiande Sun
    Huaxiang Zhang
    Multimedia Tools and Applications, 2018, 77 : 22247 - 22266
  • [4] Cross-media retrieval with collective deep semantic learning
    Zhang, Bin
    Zhu, Lei
    Sun, Jiande
    Zhang, Huaxiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (17) : 22247 - 22266
  • [5] Cross-media Deep Fine-grained Correlation Learning
    Zhuo Y.-K.
    Qi J.-W.
    Peng Y.-X.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (04): : 884 - 895
  • [6] A Unified Semantic Model for Cross-Media Events Analysis in Online Social Networks
    Fang, Mingzhe
    Li, Yang
    Hui, Ying
    Mao, Shuang
    Shi, Peng
    IEEE ACCESS, 2019, 7 : 32166 - 32182
  • [7] Learning in cross-media environment
    Bonometti S.
    Bonometti, Stefano, 1600, IGI Global (12): : 48 - 57
  • [8] Relative image similarity learning with contextual information for Internet cross-media retrieval
    Shuqiang Jiang
    Xinhang Song
    Qingming Huang
    Multimedia Systems, 2014, 20 : 645 - 657
  • [9] Cross-Media Semantic Correlation Learning Based on Deep Hash Network and Semantic Expansion for Social Network Cross-Media Search
    Liang, Meiyu
    Du, Junping
    Yang, Congxian
    Xue, Zhe
    Li, Haisheng
    Kou, Feifei
    Geng, Yue
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3634 - 3648
  • [10] Relative image similarity learning with contextual information for Internet cross-media retrieval
    Jiang, Shuqiang
    Song, Xinhang
    Huang, Qingming
    MULTIMEDIA SYSTEMS, 2014, 20 (06) : 645 - 657