Impact of convolutional neural network and FastText embedding on text classification

被引:44
|
作者
Umer, Muhammad [1 ]
Imtiaz, Zainab [2 ]
Ahmad, Muhammad [3 ]
Nappi, Michele [4 ]
Medaglia, Carlo [5 ]
Choi, Gyu Sang [6 ]
Mehmood, Arif [1 ]
机构
[1] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Pakistan
[2] Khwaja Fareed Univ Engn & Informat Technol KFUEIT, Dept Comp Sci, Rahim Yar Khan, Pakistan
[3] Khwaja Fareed Univ Engn & Informat Technol KFUEIT, Dept Comp Engn, Rahim Yar Khan, Pakistan
[4] Univ Salerno, Dept Comp Sci, Fisciano, Italy
[5] Link Campus Univ Rome, Res Dept, Via Casale San Pio V 44, I-00165 Rome, Italy
[6] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, South Korea
基金
新加坡国家研究基金会;
关键词
Convolutional Neural Network (CNN); FastText; Text mining; Deep learning; Natural language processing; SENTIMENT;
D O I
10.1007/s11042-022-13459-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient word representation techniques (word embeddings) with modern machine learning models have shown reasonable improvement on automatic text classification tasks. However, the effectiveness of such techniques has not been evaluated yet in terms of insufficient word vector representation for training. Convolutional Neural Network has achieved significant results in pattern recognition, image analysis, and text classification. This study investigates the application of the CNN model on text classification problems by experimentation and analysis. We trained our classification model with a prominent word embedding generation model, Fast Text on publically available datasets, six benchmark datasets including Ag News, Amazon Full and Polarity, Yahoo Question Answer, Yelp Full, and Polarity. Furthermore, the proposed model has been tested on the Twitter US airlines non-benchmark dataset as well. The analysis indicates that using Fast Text as word embedding is a very promising approach.
引用
收藏
页码:5569 / 5585
页数:17
相关论文
共 50 条
  • [1] Impact of convolutional neural network and FastText embedding on text classification
    Muhammad Umer
    Zainab Imtiaz
    Muhammad Ahmad
    Michele Nappi
    Carlo Medaglia
    Gyu Sang Choi
    Arif Mehmood
    Multimedia Tools and Applications, 2023, 82 : 5569 - 5585
  • [2] Convolutional Neural Network with Contextualized Word Embedding for Text Classification
    Fan, Gaoyang
    Zhu, Cui
    Zhu, Wenjun
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2019, 11321
  • [3] Text classification problems via BERT embedding method and graph convolutional neural network
    Loc Tran
    Lam Pham
    Tuan Tran
    An Mai
    2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021), 2021, : 260 - 264
  • [4] Transformable Convolutional Neural Network for Text Classification
    Xiao, Liqiang
    Zhang, Honglun
    Chen, Wenqing
    Wang, Yongkun
    Jin, Yaohui
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4496 - 4502
  • [5] Fault Text Classification Based on Convolutional Neural Network
    Wang, Lixia
    Zhang, Botao
    2020 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND APPLICATIONS (ICIEA 2020), 2020, : 937 - 941
  • [6] Application of Improved Convolutional Neural Network in Text Classification
    Ronghui, Liu
    Xinhong, Wei
    IAENG International Journal of Computer Science, 2022, 49 (03)
  • [7] Application of Convexified Convolutional Neural Network in Text Classification
    Bian, Yuanchong
    Li, Chang
    Wang, Bincheng
    Zhang, Xingjian
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS AND COMPUTER ENGINEERING (ICCECE), 2021, : 296 - 300
  • [8] Semantic expansion using word embedding clustering and convolutional neural network for improving short text classification
    Wang, Peng
    Xu, Bo
    Xu, Jiaming
    Tian, Guanhua
    Liu, Cheng-Lin
    Hao, Hongwei
    NEUROCOMPUTING, 2016, 174 : 806 - 814
  • [9] Text Classification with Topic-based Word Embedding and Convolutional Neural Networks
    Xu, Haotian
    Dong, Ming
    Zhu, Dongxiao
    Kotov, Alexander
    Carcone, April Idalski
    Naar-King, Sylvie
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 88 - 97
  • [10] Dynamic Embedding Projection-Gated Convolutional Neural Networks for Text Classification
    Tan, Zhipeng
    Chen, Jing
    Kang, Qi
    Zhou, MengChu
    Abusorrah, Abdullah
    Sedraoui, Khaled
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (03) : 973 - 982