Enhancing Semi-Supervised Learning with Cross-Modal Knowledge

被引:3
|
作者
Zhu, Hui [1 ,2 ,3 ]
Lu, Yongchun [3 ]
Wang, Hongbin [3 ]
Zhou, Xunyi [3 ]
Ma, Qin [4 ]
Liu, Yanhong [3 ]
Jiang, Ning [3 ]
Wei, Xin [3 ]
Zeng, Linchengxi [3 ]
Zhao, Xiaofang [1 ,5 ,6 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Mashang Consumer Finance Co Ltd, Chongqing, Peoples R China
[4] China Agr Univ, Beijing, Peoples R China
[5] Inst Intelligent Comp Technol, Suzhou, Peoples R China
[6] Chinese Acad Sci, Beijing, Peoples R China
关键词
Semi-supervised learning; Cross-modal knowledge; Word embedding; Semantic hierarchy structure; Curriculum learning;
D O I
10.1145/3503161.3548026
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Semi-supervised learning (SSL), which leverages a small number of labeled data that rely on expert knowledge and a large number of easily accessible unlabeled data, has made rapid progress recently. However, the information comes from a single modality and the corresponding labels are in form of one-hot in pre-existing SSL approaches, which can easily lead to deficiency supervision, omission of information and unsatisfactory results, especially when more categories and less labeled samples are covered. In this paper, we propose a novel method to further enhance SSL by introducing semantic modal knowledge, which contains the word embeddings of class labels and the semantic hierarchy structure among classes. The former helps retain more potential information and almost quantitatively reflects the similarities and differences between categories. The later encourages the model to construct the classification edge from simple to complex, and thus improves the generalization ability of the model. Comprehensive experiments and ablation studies are conducted on commonly-used datasets to demonstrate the effectiveness of our method.
引用
收藏
页码:4456 / 4465
页数:10
相关论文
共 50 条
  • [1] Semi-Supervised Knowledge Distillation for Cross-Modal Hashing
    Su, Mingyue
    Gu, Guanghua
    Ren, Xianlong
    Fu, Hao
    Zhao, Yao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 662 - 675
  • [2] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Fuhao Zou
    Xingqiang Bai
    Chaoyang Luan
    Kai Li
    Yunfei Wang
    Hefei Ling
    [J]. World Wide Web, 2019, 22 : 825 - 841
  • [3] Semi-supervised cross-modal learning for cross modal retrieval and image annotation
    Zou, Fuhao
    Bai, Xingqiang
    Luan, Chaoyang
    Li, Kai
    Wang, Yunfei
    Ling, Hefei
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 825 - 841
  • [4] A semi-supervised cross-modal memory bank for cross-modal retrieval
    Huang, Yingying
    Hu, Bingliang
    Zhang, Yipeng
    Gao, Chi
    Wang, Quan
    [J]. NEUROCOMPUTING, 2024, 579
  • [5] Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
    Zhang, Liang
    Ma, Bingpeng
    He, Jianfeng
    Li, Guorong
    Huang, Qingming
    Tian, Qi
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3406 - 3412
  • [6] Combining cross-modal knowledge transfer and semi-supervised learning for speech emotion recognition
    Zhang, Sheng
    Chen, Min
    Chen, Jincai
    Li, Yuan-Fang
    Wu, Yiling
    Li, Minglei
    Zhu, Chuanbo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [7] Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval
    Zhang, Liang
    Ma, Bingpeng
    Li, Guorong
    Huang, Qingming
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (01) : 128 - 141
  • [8] Semi-Supervised Semi-Paired Cross-Modal Hashing
    Zhang, Xuening
    Liu, Xingbo
    Nie, Xiushan
    Kang, Xiao
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 6517 - 6529
  • [9] Semi-Supervised Cross-Modal Retrieval With Label Prediction
    Mandal, Devraj
    Rao, Pramod
    Biswas, Soma
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (09) : 2345 - 2353
  • [10] Semi-supervised Deep Quantization for Cross-modal Search
    Wang, Xin
    Zhu, Wenwu
    Liu, Chenghao
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1730 - 1739