Multi-modal kernel ridge regression for social image classification

被引:12
|
作者
Zhang, Xiaoming [1 ]
Chao, Wenhan [2 ]
Li, Zhoujun [3 ]
Liu, Chunyang [4 ]
Li, Rui [4 ]
机构
[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, SKLSDE, Beijing 100191, Peoples R China
[3] Beihang Univ, Beijing Key Lab Network Technol, Beijing, Peoples R China
[4] Natl Comp Network Emergency Response Tech Team Co, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Image classification; Multi-modal learning; Kernel ridge regression; Feature fusion;
D O I
10.1016/j.asoc.2018.02.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is growing interest in social image classification because of its importance in web-based image application. Though there are many approaches on image classification, it is still a great problem to integrate multi-modal contents of social images simultaneously for classification, since the textual content and visual content are represented in two heterogeneous feature spaces. In this study, a multi-modal learning algorithm is proposed to fuse the multiple features through their correlation seamlessly. Specifically, two classification modules based on the kernel ridge regression (KRR) are learned for the two types of features, and they are integrated via a joint model. With the joint model, the classification based on visual features can be reinforced by the classification based on textual features, and vice verse. Then, an efficient optimization method is proposed to resolving the object function. The query image can be classified based on both of the textual features and visual features by combing the results of the two classifiers. Two methods are proposed to combine the classification results to obtain the final result. To evaluate the approach, extensive experiments are conducted on the real-world datasets, and the result demonstrates the superiority of our approach. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:117 / 125
页数:9
相关论文
共 50 条
  • [31] Multi-modal, Multi-task and Multi-label for Music Genre Classification and Emotion Regression
    Pandeya, Yagya Raj
    You, Jie
    Bhattarai, Bhuwan
    Lee, Joonwhoan
    [J]. 12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1042 - 1045
  • [32] Multi-Modal Pedestrian Detection with Large Misalignment Based on Modal-Wise Regression and Multi-Modal IoU
    Wanchaitanawong, Napat
    Tanaka, Masayuki
    Shibata, Takashi
    Okutomi, Masatoshi
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [33] Semantically Multi-modal Image Synthesis
    Zhu, Zhen
    Xu, Zhiliang
    You, Ansheng
    Bai, Xiang
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5466 - 5475
  • [34] Multi-modal component subspace-similarity-based multi-kernel SVM for schizophrenia classification
    Gao, Shuang
    Calhoun, Vince D.
    Sui, Jing
    [J]. MEDICAL IMAGING 2020: COMPUTER-AIDED DIAGNOSIS, 2020, 11314
  • [35] Multi-modal semantic image segmentation
    Pemasiri, Akila
    Kien Nguyen
    Sridharan, Sridha
    Fookes, Clinton
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 202
  • [36] Multi-modal multi-layered topic classification model for social event analysis
    Y. H. Chen
    C. Y. Yin
    Y. J. Lin
    W. L. Zuo
    [J]. Multimedia Tools and Applications, 2018, 77 : 23291 - 23315
  • [37] Multi-modal multi-layered topic classification model for social event analysis
    Chen, Y. H.
    Yin, C. Y.
    Lin, Y. J.
    Zuo, W. L.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 23291 - 23315
  • [38] MEDICAL IMAGE DENOISING USING KERNEL RIDGE REGRESSION
    Dinh Hoan Trinh
    Luong, Marie
    Rocchisani, Jean-Marie
    Canh Duong Pham
    Dibos, Francoise
    [J]. 2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011, : 1597 - 1600
  • [39] MMST: A Multi-Modal Ground-Based Cloud Image Classification Method
    Wei, Liang
    Zhu, Tingting
    Guo, Yiren
    Ni, Chao
    [J]. SENSORS, 2023, 23 (09)
  • [40] Deep learning supported breast cancer classification with multi-modal image fusion
    Hamdy, Eman
    Zaghloul, Mohamed Saad
    Badawy, Osama
    [J]. 2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 319 - 325