Exploiting the Relationship Between Visual and Textual Features in Social Networks for Image Classification with Zero-Shot Deep Learning

被引:4
|
作者
Lucas, Luis [1 ]
Tomas, David [1 ]
Garcia-Rodriguez, Jose [1 ]
机构
[1] Univ Alicante, Inst Informat Res, Alicante, Spain
关键词
Multimodal classification; CLIP; Zero-shot classification; Unsupervised machine learning; Social media;
D O I
10.1007/978-3-030-87869-6_35
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the main issues related to unsupervised machine learning is the cost of processing and extracting useful information from large datasets. In this work, we propose a classifier ensemble based on the transferable learning capabilities of the CLIP neural network architecture in multimodal environments (image and text) from social media. For this purpose, we used the InstaNY100K dataset and proposed a validation approach based on sampling techniques. Our experiments, based on image classification tasks according to the labels of the Places dataset, are performed by first considering only the visual part, and then adding the associated texts as support. The results obtained demonstrated that trained neural networks such as CLIP can be successfully applied to image classification with little fine-tuning, and considering the associated texts to the images can help to improve the accuracy depending on the goal. The results demonstrated what seems to be a promising research direction.
引用
收藏
页码:369 / 378
页数:10
相关论文
共 50 条
  • [1] Sentiment Analysis and Image Classification in Social Networks with Zero-Shot Deep Learning: Applications in Tourism
    Lucas, Luis
    Tomas, David
    Garcia-Rodriguez, Jose
    [J]. 16TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING MODELS IN INDUSTRIAL AND ENVIRONMENTAL APPLICATIONS (SOCO 2021), 2022, 1401 : 419 - 428
  • [2] Boosting Zero-Shot Image Classification via Pairwise Relationship Learning
    Li, Hanhui
    Wu, Hefeng
    Lin, Shujin
    Lin, Liang
    Luo, Xiaonan
    Izquierdo, Ebroul
    [J]. COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 : 85 - 99
  • [3] Zero-Shot Transfer Learning Based on Visual and Textual Resemblance
    Yang, Gang
    Xu, Jieping
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 353 - 362
  • [4] Class knowledge overlay to visual feature learning for zero-shot image classification
    Xie, Cheng
    Zeng, Ting
    Xiang, Hongxin
    Li, Keqin
    Yang, Yun
    Liu, Qing
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 207
  • [5] Learning visual-and-semantic knowledge embedding for zero-shot image classification
    Dehui Kong
    Xiliang Li
    Shaofan Wang
    Jinghua Li
    Baocai Yin
    [J]. Applied Intelligence, 2023, 53 : 2250 - 2264
  • [6] Learning visual-and-semantic knowledge embedding for zero-shot image classification
    Kong, Dehui
    Li, Xiliang
    Wang, Shaofan
    Li, Jinghua
    Yin, Baocai
    [J]. APPLIED INTELLIGENCE, 2023, 53 (02) : 2250 - 2264
  • [7] Learning unseen visual prototypes for zero-shot classification
    Li, Xiao
    Fang, Min
    Feng, Dazheng
    Li, Haikun
    Wu, Jinqiao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 160 : 176 - 187
  • [8] Robust image features for classification and zero-shot tasks by merging visual and semantic attributes
    Damares Crystina Oliveira de Resende
    Moacir Antonelli Ponti
    [J]. Neural Computing and Applications, 2022, 34 : 4459 - 4471
  • [9] Robust image features for classification and zero-shot tasks by merging visual and semantic attributes
    Oliveira de Resende, Damares Crystina
    Ponti, Moacir Antonelli
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (06): : 4459 - 4471
  • [10] Zero-Shot Image Classification Based on Deep Feature Extraction
    Wang, Xuesong
    Chen, Chen
    Cheng, Yuhu
    Wang, Z. Jane
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2018, 10 (02) : 432 - 444