Accumulated reconstruction error vector (AREV): a semantic representation for cross-media retrieval

被引:0
|
作者
Kai Liu
Shikui Wei
Yao Zhao
Zhenfeng Zhu
Yunchao Wei
Changsheng Xu
机构
[1] Beijing Jiaotong University,Institute of Information Science
[2] Chinese Academy of Sciences,Institute of Automation
[3] Beijing Key Laboratory of Advanced Information Science and Network Technology,undefined
来源
关键词
Cross-media; Accumulated reconstruction error vector; Retrieval; Consistency; Dictionary learning;
D O I
暂无
中图分类号
学科分类号
摘要
Cross-media retrieval aims to automatically perform the content-based search procedure among various media types (e.g., image, video and text), in which media representation plays an important role for providing the heterogeneous similarity measure. In this work, a novel semantic representation of cross-media, called accumulated reconstruction error vector (AREV), is proposed, which includes category-specific dictionary learning, media sample reconstruction, and accumulative reconstruction error concatenation. Instead of directly learning the correlation relationship among heterogeneous items in the same semantic groups, the AREV projects individually their original feature descriptions into a shared semantic space, in which each component is semantic consistent for various media types due to the consistency in category information. Experiments on the commonly used datasets, i.e. Wikipedia dataset and NUS-Wide dataset, show the good performance in terms of effectiveness and efficiency.
引用
收藏
页码:561 / 576
页数:15
相关论文
共 50 条
  • [41] A Benchmark Dataset and Learning High-Level Semantic Embeddings of Multimedia for Cross-media Retrieval
    Rehman, Sadaqat Ur
    Tu, Shanshan
    Huang, Yongfeng
    Rehman, Obaid Ur
    IEEE ACCESS, 2018, 6 : 67176 - 67188
  • [42] CSRNCVA: A MODEL OF CROSS-MEDIA SEMANTIC RETRIEVAL BASED ON NEURAL COMPUTING OF VISUAL AND AUDITORY SENSATIONS
    Liu, Y.
    Cai, K.
    Liu, C.
    Zheng, F.
    NEURAL NETWORK WORLD, 2018, 28 (04) : 305 - 323
  • [43] CROSS-MODALITY CORRELATION PROPAGATION FOR CROSS-MEDIA RETRIEVAL
    Zhai, Xiaohua
    Peng, Yuxin
    Xiao, Jianguo
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2337 - 2340
  • [44] Toward cross-language and cross-media image retrieval
    Alvarez, C
    Oumohmed, AI
    Mignotte, M
    Nie, JY
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 676 - 687
  • [45] Internet cross-media retrieval based on deep learning
    Jiang, Bin
    Yang, Jiachen
    Lv, Zhihan
    Tian, Kun
    Meng, Qinggang
    Yan, Yan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 48 : 356 - 366
  • [46] Bagging-based cross-media retrieval algorithm
    Xu, Gongwen
    Zhang, Yu
    Yin, Mingshan
    Hong, Wenzhong
    Zou, Ran
    Wang, Shanshan
    SOFT COMPUTING, 2023, 27 (05) : 2615 - 2623
  • [47] Complementary information retrieval for cross-media news content
    Ma, Qiang
    Nadamoto, Akiyo
    Tanaka, Katsumi
    INFORMATION SYSTEMS, 2006, 31 (07) : 659 - 678
  • [48] Cross-media retrieval based on synthesis reasoning model
    College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2009, 9 (1307-1314):
  • [49] Cross-media retrieval based on linear discriminant analysis
    Qi, Yudan
    Zhang, Huaxiang
    Zhang, Bin
    Wang, Li
    Zheng, Shunxin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 24249 - 24268
  • [50] Finding the best picture: Cross-media retrieval of content
    Deschacht, Koen
    Moens, Marie-Francine
    ADVANCES IN INFORMATION RETRIEVAL, 2008, 4956 : 539 - 546