Short text classification based on feature extension using information in images

被引:0
|
作者
Zhao S. [1 ,2 ]
Jiang Q. [1 ]
机构
[1] College of Electronic and Information Engineering, Tongji University, Shanghai
[2] School of Software Engineering, Tongji University, Shanghai
关键词
Feature extension; Image caption; Sentence similarity; Short text classification;
D O I
10.23940/ijpe.19.02.p31.667675
中图分类号
学科分类号
摘要
With the quick development and extensive application of the Internet, there is a growing desire for people to share their life or opinions on social networks, which produces a mass of short texts. Short texts are characterized by short length, sparse features, and a lack of contextual information. Thus, it is difficult for conventional methods to achieve high quality classification performance. To achieve a higher classification accuracy, this paper proposes a novel short text classification method based on feature extension by incorporating the information of the images. Specifically, we first generate a sentence that descripts the images by image caption technology, and then we combine the generated sentence with the text as the input of the classifier. Meanwhile, we introduce a similarity module in terms of the correlation between the image and the short text so as to determine whether the two sentences are combined or not. Simulation results show that our proposed model significantly outperforms the state-of-the-art methods in terms of classification accuracy. © 2019 Totem Publisher, Inc. All rights reserved.
引用
收藏
页码:667 / 675
页数:8
相关论文
共 50 条
  • [1] Short Text Classification Based on Feature Extension Using The N-Gram Model
    Zhang, Xinwei
    Wu, Bin
    [J]. 2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 710 - 716
  • [2] Short Text Sentiment Classification Based on Feature extension and ensemble classifier
    Liu, Yang
    Zhu, Xie
    [J]. 6TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION (CDMMS 2018), 2018, 1967
  • [3] Short Text Classification Improved by Feature Space Extension
    Li, Yanxuan
    [J]. 2019 THE 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, CONTROL AND ROBOTICS (EECR 2019), 2019, 533
  • [4] WEFEST: Word Embedding Feature Extension for Short Text Classification
    Sang, Lei
    Xie, Fei
    Liu, Xiaojian
    Wu, Xindong
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 677 - 683
  • [5] Short Text Classification Based on Keywords Extension
    Gu, Yiran
    Shen, Jiajia
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2616 - 2621
  • [6] Feature Extension for Chinese Short Text Classification Based on Topical N-Grams
    Sun, Baoshan
    Zhao, Peng
    [J]. 2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 477 - 482
  • [7] Feature Extension for short text
    Yan Tao
    Wang Xi-wei
    [J]. THIRD INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND COMPUTATIONAL TECHNOLOGY (ISCSCT 2010), 2010, : 338 - 341
  • [8] Association Rules Based Short Text Feature Extension
    Huang Wei
    Li Shan-Fei
    Tan Yue-Jin
    Gao Bing
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (10): : 227 - 230
  • [9] Feature Extension for Chinese Short Text Classification Based on LDA and Word2vec
    Sun, Fanke
    Chen, Heping
    [J]. PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 1189 - 1194
  • [10] Combining Statistical Information and Semantic Similarity for Short Text Feature Extension
    Li, Xiaohong
    Su, Yun
    Ma, Huifang
    Cao, Lin
    [J]. INTELLIGENT INFORMATION PROCESSING VIII, 2016, 486 : 205 - 210