Multi-modal kernel ridge regression for social image classification

被引:12
|
作者
Zhang, Xiaoming [1 ]
Chao, Wenhan [2 ]
Li, Zhoujun [3 ]
Liu, Chunyang [4 ]
Li, Rui [4 ]
机构
[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, SKLSDE, Beijing 100191, Peoples R China
[3] Beihang Univ, Beijing Key Lab Network Technol, Beijing, Peoples R China
[4] Natl Comp Network Emergency Response Tech Team Co, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Image classification; Multi-modal learning; Kernel ridge regression; Feature fusion;
D O I
10.1016/j.asoc.2018.02.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is growing interest in social image classification because of its importance in web-based image application. Though there are many approaches on image classification, it is still a great problem to integrate multi-modal contents of social images simultaneously for classification, since the textual content and visual content are represented in two heterogeneous feature spaces. In this study, a multi-modal learning algorithm is proposed to fuse the multiple features through their correlation seamlessly. Specifically, two classification modules based on the kernel ridge regression (KRR) are learned for the two types of features, and they are integrated via a joint model. With the joint model, the classification based on visual features can be reinforced by the classification based on textual features, and vice verse. Then, an efficient optimization method is proposed to resolving the object function. The query image can be classified based on both of the textual features and visual features by combing the results of the two classifiers. Two methods are proposed to combine the classification results to obtain the final result. To evaluate the approach, extensive experiments are conducted on the real-world datasets, and the result demonstrates the superiority of our approach. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:117 / 125
页数:9
相关论文
共 50 条
  • [41] Multi-Modal Retinal Image Classification With Modality-Specific Attention Network
    He, Xingxin
    Deng, Ying
    Fang, Leyuan
    Peng, Qinghua
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (06) : 1591 - 1602
  • [42] Densely Convolutional Networks for Breast Cancer Classification with Multi-Modal Image Fusion
    Hamdy, Eman
    Badawy, Osama
    Zaghloul, Mohamed
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (3A) : 463 - 469
  • [43] Multi-modal Remote Sensing Image Classification for Low Sample Size Data
    He, Qi
    Lee, Yao
    Huang, Dongmei
    He, Shengqi
    Song, Wei
    Du, Yanling
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [44] An Unsupervised Domain Adaptation Method for Multi-Modal Remote Sensing Image Classification
    Liu, Wei
    Qin, Rongjun
    Su, Fulin
    Hu, Kun
    [J]. 2018 26TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS (GEOINFORMATICS 2018), 2018,
  • [45] PolSAR Image Classification Based on Multi-Modal Contrastive Fully Convolutional Network
    Hua, Wenqiang
    Wang, Yi
    Yang, Sijia
    Jin, Xiaomin
    [J]. REMOTE SENSING, 2024, 16 (02)
  • [46] Multiple Kernel Learning Based Classification of Parkinson's Disease With Multi-Modal Transcranial Sonography
    Shi, Jun
    Yan, Minjun
    Dong, Yun
    Zheng, Xiao
    Zhang, Qi
    An, Hedi
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 61 - 64
  • [47] Supervised kernel-based multi-modal Bhattacharya distance learning for imbalanced data classification
    Mojahed, Atena Jalali
    Moattar, Mohammad Hossein
    Ghaffari, Hamidreza
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2024,
  • [48] Boosted Multi-Modal Supervised Latent Dirichlet Allocation for Social Event Classification
    Qian, Shengsheng
    Zhang, Tianzhu
    Xu, Changsheng
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1999 - 2004
  • [49] Cross-Modal Retrieval Augmentation for Multi-Modal Classification
    Gur, Shir
    Neverova, Natalia
    Stauffer, Chris
    Lim, Ser-Nam
    Kiela, Douwe
    Reiter, Austin
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 111 - 123
  • [50] Sentiment Classification Algorithm Based on Multi-Modal Social Media Text Information
    Xuanyuan, Minzheng
    Xiao, Le
    Duan, Mengshi
    [J]. IEEE ACCESS, 2021, 9 : 33410 - 33418