Multi-modal Learning for Social Image Classification

被引:0
|
作者
Liu, Chunyang [1 ]
Zhang, Xu [1 ]
Li, Xiong [1 ]
Li, Rui [2 ]
Zhang, Xiaoming [3 ]
Chao, Wenhan [3 ]
机构
[1] Natl Comp Network Emergency Response Tech Team Ch, Beijing, Peoples R China
[2] Coordinat Ctr China, Natl Comp Network Emergency Response Tech Team, Beijing, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
There is growing interest in social image classification because of its importance in web-based image application. Though there are many approaches on image classification, it is a great problem to integrate multi-modal content of social images simultaneously for social image classification, since the textual content and visual content are represented in two heterogeneous feature spaces. In this study, we proposed a multi-modal learning algorithm to fuse the multiple features through their correlation seamlessly. Specifically, we learn two linear classification modules for the two types of feature, and then they are integrated by the l2 normalization via a joint model. With the joint model, the classification based on visual feature can be reinforced by the classification based on textual feature, and vice verse. Then, the test image can be classified based on both the textual feature and visual feature by combing the results of the two classifiers. The evaluate the approach, we conduct some experiments on real-world datasets, and the result shows the superiority of our proposed algorithm against the baselines.
引用
收藏
页码:1174 / 1179
页数:6
相关论文
共 50 条
  • [1] Efficient multi-modal hypergraph learning for social image classification with complex label correlations
    Wang, Leiquan
    Zhao, Zhicheng
    Su, Fei
    [J]. NEUROCOMPUTING, 2016, 171 : 242 - 251
  • [2] Multi-modal kernel ridge regression for social image classification
    Zhang, Xiaoming
    Chao, Wenhan
    Li, Zhoujun
    Liu, Chunyang
    Li, Rui
    [J]. APPLIED SOFT COMPUTING, 2018, 67 : 117 - 125
  • [3] Multi-modal self-paced learning for image classification
    Xu, Wei
    Liu, Wei
    Huang, Xiaolin
    Yang, Jie
    Qiu, Song
    [J]. NEUROCOMPUTING, 2018, 309 : 134 - 144
  • [4] Multi-Modal Curriculum Learning for Semi-Supervised Image Classification
    Gong, Chen
    Tao, Dacheng
    Maybank, Stephen J.
    Liu, Wei
    Kang, Guoliang
    Yang, Jie
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3249 - 3260
  • [5] A Multi-modal SPM Model for Image Classification
    Zheng, Peng
    Zhao, Zhong-Qiu
    Gao, Jun
    [J]. INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 525 - 535
  • [6] Deep learning supported breast cancer classification with multi-modal image fusion
    Hamdy, Eman
    Zaghloul, Mohamed Saad
    Badawy, Osama
    [J]. 2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 319 - 325
  • [7] Image and Encoded Text Fusion for Multi-Modal Classification
    Gallo, I.
    Calefati, A.
    Nawaz, S.
    Janjua, M. K.
    [J]. 2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209
  • [8] Enhancing Image Classification Models with Multi-modal Biomarkers
    Caban, Jesus J.
    Liao, David
    Yao, Jianhua
    Mollura, Daniel J.
    Gochuico, Bernadette
    Yoo, Terry
    [J]. MEDICAL IMAGING 2011: COMPUTER-AIDED DIAGNOSIS, 2011, 7963
  • [9] Multi-modal Broad Learning System for Medical Image and Text-based Classification
    Zhou, Yanhong
    Du, Jie
    Guan, Kai
    Wang, Tianfu
    [J]. 2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3439 - 3442
  • [10] Classify social image by integrating multi-modal content
    Zhang, Xiaoming
    Zhang, Xu
    Li, Xiong
    Li, Zhoujun
    Wang, Senzhang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (06) : 7469 - 7485