Enhancing Semi-Supervised Learning with Cross-Modal Knowledge

被引:3
|
作者
Zhu, Hui [1 ,2 ,3 ]
Lu, Yongchun [3 ]
Wang, Hongbin [3 ]
Zhou, Xunyi [3 ]
Ma, Qin [4 ]
Liu, Yanhong [3 ]
Jiang, Ning [3 ]
Wei, Xin [3 ]
Zeng, Linchengxi [3 ]
Zhao, Xiaofang [1 ,5 ,6 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Mashang Consumer Finance Co Ltd, Chongqing, Peoples R China
[4] China Agr Univ, Beijing, Peoples R China
[5] Inst Intelligent Comp Technol, Suzhou, Peoples R China
[6] Chinese Acad Sci, Beijing, Peoples R China
关键词
Semi-supervised learning; Cross-modal knowledge; Word embedding; Semantic hierarchy structure; Curriculum learning;
D O I
10.1145/3503161.3548026
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Semi-supervised learning (SSL), which leverages a small number of labeled data that rely on expert knowledge and a large number of easily accessible unlabeled data, has made rapid progress recently. However, the information comes from a single modality and the corresponding labels are in form of one-hot in pre-existing SSL approaches, which can easily lead to deficiency supervision, omission of information and unsatisfactory results, especially when more categories and less labeled samples are covered. In this paper, we propose a novel method to further enhance SSL by introducing semantic modal knowledge, which contains the word embeddings of class labels and the semantic hierarchy structure among classes. The former helps retain more potential information and almost quantitatively reflects the similarities and differences between categories. The later encourages the model to construct the classification edge from simple to complex, and thus improves the generalization ability of the model. Comprehensive experiments and ablation studies are conducted on commonly-used datasets to demonstrate the effectiveness of our method.
引用
下载
收藏
页码:4456 / 4465
页数:10
相关论文
共 50 条
  • [41] Semi-supervised learning in knowledge discovery
    Klose, A
    Kruse, R
    FUZZY SETS AND SYSTEMS, 2005, 149 (01) : 209 - 233
  • [42] Federated learning for supervised cross-modal retrieval
    Li, Ang
    Li, Yawen
    Shao, Yingxia
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (04):
  • [43] AutoDLAR: A Semi-supervised Cross-modal Contact-free Human Activity Recognition System
    Lu, Xinxin
    Wang, Lei
    Lin, Chi
    Fan, Xin
    Han, Bin
    Han, Xin
    Qin, Zhenquan
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2024, 20 (04)
  • [44] SEMI-SUPERVISED GRAPH CONVOLUTIONAL HASHING NETWORK FOR LARGE-SCALE CROSS-MODAL RETRIEVAL
    Shen, Zhanjian
    Zhai, Deming
    Liu, Xianming
    Jiang, Junjun
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2366 - 2370
  • [45] Coupled feature selection based semi-supervised modality-dependent cross-modal retrieval
    En Yu
    Jiande Sun
    Li Wang
    Wenbo Wan
    Huaxiang Zhang
    Multimedia Tools and Applications, 2019, 78 : 28931 - 28951
  • [46] Coupled feature selection based semi-supervised modality-dependent cross-modal retrieval
    Yu, En
    Sun, Jiande
    Wang, Li
    Wan, Wenbo
    Zhang, Huaxiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (20) : 28931 - 28951
  • [47] Comprehensive Semi-Supervised Multi-Modal Learning
    Yang, Yang
    Wang, Ke-Tao
    Zhan, De-Chuan
    Xiong, Hui
    Jiang, Yuan
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4092 - 4098
  • [48] Semi-Relaxation Supervised Hashing for Cross-Modal Retrieval
    Zhang, Peng-Fei
    Li, Chuan-Xiang
    Liu, Meng-Yuan
    Nie, Liqiang
    Xu, Xin-Shun
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1762 - 1770
  • [49] X-ModalNet: A semi-supervised deep cross-modal network for classification of remote sensing data
    Hong, Danfeng
    Yokoya, Naoto
    Xia, Gui-Song
    Chanussot, Jocelyn
    Zhu, Xiao Xiang
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 167 : 12 - 23
  • [50] Hope: A Hierarchical Perspective for Semi-supervised 2D-3D Cross-Modal Retrieval
    Zhang F.
    Zhou H.
    Hua X.
    Chen C.
    Luo X.
    IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 46 (12) : 1 - 18