Enhancing Semi-Supervised Learning with Cross-Modal Knowledge

被引:3
|
作者
Zhu, Hui [1 ,2 ,3 ]
Lu, Yongchun [3 ]
Wang, Hongbin [3 ]
Zhou, Xunyi [3 ]
Ma, Qin [4 ]
Liu, Yanhong [3 ]
Jiang, Ning [3 ]
Wei, Xin [3 ]
Zeng, Linchengxi [3 ]
Zhao, Xiaofang [1 ,5 ,6 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Mashang Consumer Finance Co Ltd, Chongqing, Peoples R China
[4] China Agr Univ, Beijing, Peoples R China
[5] Inst Intelligent Comp Technol, Suzhou, Peoples R China
[6] Chinese Acad Sci, Beijing, Peoples R China
关键词
Semi-supervised learning; Cross-modal knowledge; Word embedding; Semantic hierarchy structure; Curriculum learning;
D O I
10.1145/3503161.3548026
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Semi-supervised learning (SSL), which leverages a small number of labeled data that rely on expert knowledge and a large number of easily accessible unlabeled data, has made rapid progress recently. However, the information comes from a single modality and the corresponding labels are in form of one-hot in pre-existing SSL approaches, which can easily lead to deficiency supervision, omission of information and unsatisfactory results, especially when more categories and less labeled samples are covered. In this paper, we propose a novel method to further enhance SSL by introducing semantic modal knowledge, which contains the word embeddings of class labels and the semantic hierarchy structure among classes. The former helps retain more potential information and almost quantitatively reflects the similarities and differences between categories. The later encourages the model to construct the classification edge from simple to complex, and thus improves the generalization ability of the model. Comprehensive experiments and ablation studies are conducted on commonly-used datasets to demonstrate the effectiveness of our method.
引用
下载
收藏
页码:4456 / 4465
页数:10
相关论文
共 50 条
  • [21] Semantic Consistency Cross-Modal Retrieval With Semi-Supervised Graph Regularization
    Xu, Gongwen
    Li, Xiaomei
    Zhang, Zhijun
    IEEE ACCESS, 2020, 8 : 14278 - 14288
  • [22] Semi-supervised cross-modal image generation with generative adversarial networks
    Li, Dan
    Du, Changde
    He, Huiguang
    PATTERN RECOGNITION, 2020, 100
  • [23] Semi-supervised constrained graph convolutional network for cross-modal retrieval
    Zhang, Lei
    Chen, Leiting
    Ou, Weihua
    Zhou, Chuan
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [24] Semi-supervised semantic factorization hashing for fast cross-modal retrieval
    Jiale Wang
    Guohui Li
    Peng Pan
    Xiaosong Zhao
    Multimedia Tools and Applications, 2017, 76 : 20197 - 20215
  • [25] Semi-supervised cross-modal common representation learning with vector-valued manifold regularization
    Zhang, Hong
    Wang, Ting
    Dai, Gang
    PATTERN RECOGNITION LETTERS, 2020, 130 : 335 - 344
  • [26] Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching
    Liang, Jingjun
    Li, Ruichen
    Jin, Qin
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2852 - 2861
  • [27] Semi-supervised cross-modal representation learning with GAN-based Asymmetric Transfer Network
    Zhang, Lei
    Chen, Leiting
    Ou, Weihua
    Zhou, Chuan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 73
  • [28] SEMI-SUPERVISED SEMANTIC-PRESERVING HASHING FOR EFFICIENT CROSS-MODAL RETRIEVAL
    Wang, Xingzhi
    Liu, Xin
    Hu, Zhikai
    Wang, Nannan
    Fan, Wentao
    Du, Ji-Xiang
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1006 - 1011
  • [29] Semi-supervised cross-modal hashing with multi-view graph representation
    Shen, Xiao
    Zhang, Haofeng
    Li, Lunbo
    Yang, Wankou
    Liu, Li
    INFORMATION SCIENCES, 2022, 604 : 45 - 60
  • [30] Semi-supervised Cross-Modal Hashing Based on Label Prediction and Distance Preserving
    Zhang, Xu
    Tian, Xin
    Yang, Bing
    Zhang, Zuyu
    Li, Yan
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 324 - 330