Deep Visual Semantic Embedding with Text Data Augmentation and Word Embedding Initialization

被引:4
|
作者
He, Hai [1 ]
Yang, Haibo [2 ]
机构
[1] Chongqing City Management Coll, Sch Big Data & Informat Ind, Chongqing 401331, Peoples R China
[2] Chongqing Med Univ, Informat Ctr, Chongqing 400016, Peoples R China
关键词
SENTIMENT CLASSIFICATION;
D O I
10.1155/2021/6654071
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Language and vision are the two most essential parts of human intelligence for interpreting the real world around us. How to make connections between language and vision is the key point in current research. Multimodality methods like visual semantic embedding have been widely studied recently, which unify images and corresponding texts into the same feature space. Inspired by the recent development of text data augmentation and a simple but powerful technique proposed called EDA (easy data augmentation), we can expand the information with given data using EDA to improve the performance of models. In this paper, we take advantage of the text data augmentation technique and word embedding initialization for multimodality retrieval. We utilize EDA for text data augmentation, word embedding initialization for text encoder based on recurrent neural networks, and minimizing the gap between the two spaces by triplet ranking loss with hard negative mining. On two Flickr-based datasets, we achieve the same recall with only 60% of the training dataset as the normal training with full available data. Experiment results show the improvement of our proposed model; and, on all datasets in this paper (Flickr8k, Flickr30k, and MS-COCO), our model performs better on image annotation and image retrieval tasks; the experiments also demonstrate that text data augmentation is more suitable for smaller datasets, while word embedding initialization is suitable for larger ones.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Analysing the Semantic Change Based on Word Embedding
    Liao, Xuanyi
    Cheng, Guang
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 213 - 223
  • [22] Impact of word embedding models on text analytics in deep learning environment: a review
    Deepak Suresh Asudani
    Naresh Kumar Nagwani
    Pradeep Singh
    Artificial Intelligence Review, 2023, 56 : 10345 - 10425
  • [23] Impact of word embedding models on text analytics in deep learning environment: a review
    Asudani, Deepak Suresh
    Nagwani, Naresh Kumar
    Singh, Pradeep
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) : 10345 - 10425
  • [24] The Evaluation of Word Embedding Models and Deep Learning Algorithms for Turkish Text Classification
    Kilimci, Zeynep Hilal
    Akyokus, Selim
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 548 - 553
  • [25] In Defense of Word Embedding for Generic Text Representation
    Lev, Guy
    Klein, Benjamin
    Wolf, Lior
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2015, 2015, 9103 : 35 - 50
  • [26] Text document summarization using word embedding
    Mohd, Mudasir
    Jan, Rafiya
    Shah, Muzaffar
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 143 (143)
  • [27] A Weighted Word Embedding Model for Text Classification
    Ren, Haopeng
    Zeng, ZeQuan
    Cai, Yi
    Du, Qing
    Li, Qing
    Xie, Haoran
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 419 - 434
  • [28] Jointly smoothing word embedding and text representation
    Najar, Fatma
    Bouguila, Nizar
    2021 IEEE 22ND INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2021), 2021, : 282 - 289
  • [29] Study on the Chinese Word Semantic Relation Classification with Word Embedding
    Shijia, E.
    Jia, Shengbin
    Xiang, Yang
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 849 - 855
  • [30] Multi-Task Visual Semantic Embedding Network for Image-Text Retrieval
    Qin, Xue-Yang
    Li, Li-Shuang
    Tang, Jing-Yao
    Hao, Fei
    Ge, Mei-Ling
    Pang, Guang-Yao
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2024, 39 (04) : 811 - 826