Adversarial task-specific learning

被引:0
|
作者
Fu, Xin [1 ,2 ]
Zhao, Yao [1 ,2 ]
Liu, Ting [1 ,2 ]
Wei, Yunchao [3 ]
Li, Jianan [4 ]
Wei, Shikui [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
[3] Univ Illinois, Beckman Inst, Urbana, IL USA
[4] Beijing Inst Technol, Sch Opt Engn, Beijing 100081, Peoples R China
基金
美国国家科学基金会;
关键词
Cross-modal retrieval; Adversarial learning; Subspace learning;
D O I
10.1016/j.neucom.2019.06.079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate a principle way to learn a common feature space for data of different modalities (e.g. image and text), so that the similarity between different modal items can be directly measured for benefiting cross-modal retrieval task. To effectively keep semantic/distribution consistent for common feature embeddings, we propose a new Adversarial Task-Specific Learning (ATSL) approach to learn distinct embeddings for different retrieval tasks, i.e. images retrieve texts (I2T) or texts retrieve images (T2I). In particular, the proposed ATSL is with the following advantages: (a) semantic attributes are leveraged to encourage the learned common feature embeddings of couples to be semantic consistent; (b) adversarial learning is applied to relieve the inconsistent distribution of common feature embeddings for different modalities; (c) triplet optimization is employed to guarantee that similar items from different modalities are with smaller distances in the learned common space compared with the dissimilar ones; (d) task-specific learning produces better optimized common feature embeddings for different retrieval tasks. Our ATSL is embedded in a deep neural network, which can be learned in an end-to-end manner. We conduct extensive experiments on two popular benchmark datasets, e.g. Flickr30K and MS COCO. We achieve R@1 accuracy of 57.1% and 38.4% for I2T and 56.5% and 38.6% T2I on MS COCO and Flickr30K respectively, which are the new state-of-the-arts. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:118 / 128
页数:11
相关论文
共 50 条
  • [31] TSMV: TASK-SPECIFIC MULTI-VIEW FEATURE LEARNING
    Zhang, Chengyue
    Han, Yahong
    8TH INTERNATIONAL CONFERENCE ON INTERNET MULTIMEDIA COMPUTING AND SERVICE (ICIMCS2016), 2016, : 39 - 42
  • [32] Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation
    Chen, Lin
    Chen, Huaian
    Wei, Zhixiang
    Jin, Xin
    Tan, Xiao
    Jin, Yi
    Chen, Enhong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7171 - 7180
  • [33] Learning task-specific discriminative representations for multiple object tracking
    Wu, Han
    Nie, Jiahao
    Zhu, Ziming
    He, Zhiwei
    Gao, Mingyu
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (10): : 7761 - 7777
  • [34] Learning Task-Specific Representation for Novel Words in Sequence Labeling
    Peng, Minlong
    Zhang, Qi
    Xing, Xiaoyu
    Gui, Tao
    Fu, Jinlan
    Huang, Xuanjing
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5146 - 5152
  • [35] Task-specific transfer of perceptual learning across sensory modalities
    McGovern, David P.
    Astle, Andrew T.
    Clavin, Sarah L.
    Newell, Fiona N.
    CURRENT BIOLOGY, 2016, 26 (01) : R20 - R21
  • [36] TSP: Learning Task-Specific Pivots for Unsupervised Domain Adaptation
    Cui, Xia
    Coenen, Frans
    Bollegala, Danushka
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2017, PT II, 2017, 10535 : 754 - 771
  • [37] Graph Few-shot Learning with Task-specific Structures
    Wang, Song
    Chen, Chen
    Li, Jundong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [38] Reactive, Task-specific Object Manipulation by Metric Reinforcement Learning
    Hangl, Simon
    Ugur, Emre
    Szedmak, Sandor
    Piater, Justus
    Ude, Ales
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2015, : 557 - 564
  • [39] Adversarial Data Augmentation for Task-Specific Knowledge Distillation of Pre-trained Transformers
    Zhng, Minjia
    Naresh, Niranjan Uma
    He, Yuxiong
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 11685 - 11693
  • [40] Learning Task-specific Activation Functions using Genetic Programming
    Basirat, Mina
    Roth, Peter M.
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 533 - 540