Learning Semantic Text Features for Web Text-Aided Image Classification

被引:12
|
作者
Wang, Dongzhe [1 ]
Mao, Kezhi [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Semantic matching neural network; semantic filter; image classification; text representation; web image search; LOW-RANK; SELECTION; SCENE;
D O I
10.1109/TMM.2019.2920620
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The good generalization performance of conventional pattern classifiers often relies on the size of training data labeled by costly human labor. These days, publicly available web resources grow explosively, and this allows us to easily obtain abundant and cheap web data. Yet, web data are usually not as cooperative as human labeled data. In this paper, we explore the use of web text data to aid image classification. Without requiring the previous collection of auxiliary data from the web, we directly retrieve the web text information with the aid of the powerful reverse image search engine. We develop a novel textual modeling method named semantic matching neural network (SMNN) that is capable of learning semantic features from the associated text of web images. The SMNN text features have improved reliability and applicability, compared to the text features obtained from other methods. The SMNN text features and convolutional neural network (CNN) visual features are merged into a shared representation, which learns to capture the correlations between the two modalities. Experimental results on benchmark UIUC-Sports, Scene-15, Caltech-256, and Pascal VOC-2012 data sets show that the visual and text modalities of data from different sources are remarkably complementary and the fusion of them achieves substantial performance improvement.
引用
收藏
页码:2985 / 2996
页数:12
相关论文
共 50 条
  • [41] Learning semantic alignment from image for text-guided image inpainting
    Xie, Yucheng
    Lin, Zehang
    Yang, Zhenguo
    Deng, Huan
    Wu, Xingcai
    Mao, Xudong
    Li, Qing
    Liu, Wenyin
    VISUAL COMPUTER, 2022, 38 (9-10): : 3149 - 3161
  • [42] Extracting Features from Text Flows based on Semantic Similarity for Text Classification: an Approach Inspired by Audio Analysis
    Vasconcelos, Larissa Lucena
    Campelo, Claudio E. C.
    Journal of the Brazilian Computer Society, 2024, 30 (01) : 297 - 314
  • [43] Learning transferable features in meta -learning for few -shot text classification
    Xu, Jincheng
    Du, Qingfeng
    PATTERN RECOGNITION LETTERS, 2020, 135 (135) : 271 - 278
  • [44] Web Image Re-Ranking by Utilizing Text and Visual Features
    Bajpai, Aruna
    2014 CONFERENCE ON IT IN BUSINESS, INDUSTRY AND GOVERNMENT (CSIBIG), 2014,
  • [45] Data Augmentation With Semantic Enrichment for Deep Learning Invoice Text Classification
    Chi, Wei Wen
    Tang, Tiong Yew
    Salleh, Narishah Mohamed
    Mukred, Muaadh
    Alsalman, Hussain
    Zohaib, Muhammad
    IEEE ACCESS, 2024, 12 : 57326 - 57344
  • [46] Learning Refined Features for Open-World Text Classification
    Li, Zeting
    Cai, Yi
    Tan, Xingwei
    Han, Guoqiang
    Ren, Haopeng
    Wu, Xin
    Li, Wen
    WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 367 - 381
  • [47] Semantic concept space based progressive transductive learning for text classification
    Zhang, Xiaobin
    Yin, Yingshun
    Gao, Lili
    Zheng, Jing
    Niu, Yanzhan
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 324 - 328
  • [48] Semantic Interactive Learning for Text Classification: A Constructive Approach for Contextual Interactions
    Kiefer, Sebastian
    Hoffmann, Mareike
    Schmid, Ute
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (04): : 994 - 1010
  • [49] Semantic Text Encoding for Text Classification using Convolutional Neural Networks
    Gallo, Ignazio
    Nawaz, Shah
    Calefati, Alessandro
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2017), VOL 5, 2017, : 16 - 21
  • [50] Learning to Embed Semantic Similarity for Joint Image-Text Retrieval
    Malali, Noam
    Keller, Yosi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10252 - 10260