Impact of convolutional neural network and FastText embedding on text classification

被引:44
|
作者
Umer, Muhammad [1 ]
Imtiaz, Zainab [2 ]
Ahmad, Muhammad [3 ]
Nappi, Michele [4 ]
Medaglia, Carlo [5 ]
Choi, Gyu Sang [6 ]
Mehmood, Arif [1 ]
机构
[1] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Pakistan
[2] Khwaja Fareed Univ Engn & Informat Technol KFUEIT, Dept Comp Sci, Rahim Yar Khan, Pakistan
[3] Khwaja Fareed Univ Engn & Informat Technol KFUEIT, Dept Comp Engn, Rahim Yar Khan, Pakistan
[4] Univ Salerno, Dept Comp Sci, Fisciano, Italy
[5] Link Campus Univ Rome, Res Dept, Via Casale San Pio V 44, I-00165 Rome, Italy
[6] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, South Korea
基金
新加坡国家研究基金会;
关键词
Convolutional Neural Network (CNN); FastText; Text mining; Deep learning; Natural language processing; SENTIMENT;
D O I
10.1007/s11042-022-13459-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient word representation techniques (word embeddings) with modern machine learning models have shown reasonable improvement on automatic text classification tasks. However, the effectiveness of such techniques has not been evaluated yet in terms of insufficient word vector representation for training. Convolutional Neural Network has achieved significant results in pattern recognition, image analysis, and text classification. This study investigates the application of the CNN model on text classification problems by experimentation and analysis. We trained our classification model with a prominent word embedding generation model, Fast Text on publically available datasets, six benchmark datasets including Ag News, Amazon Full and Polarity, Yahoo Question Answer, Yelp Full, and Polarity. Furthermore, the proposed model has been tested on the Twitter US airlines non-benchmark dataset as well. The analysis indicates that using Fast Text as word embedding is a very promising approach.
引用
收藏
页码:5569 / 5585
页数:17
相关论文
共 50 条
  • [21] Heterogeneous graph convolutional neural network for short text classification
    Huang B.
    Li P.
    Fang Z.
    Lei L.
    Wang C.
    International Journal of Intelligent Systems Technologies and Applications, 2023, 21 (04) : 344 - 365
  • [22] Locality and Sparsity Preserving Embedding Convolutional Neural Network for Image Classification
    Xia, Yu
    Zhan, Yongzhao
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 447 - 458
  • [23] Deep Topological Embedding with Convolutional Neural Networks for Complex Network Classification
    Scabini, Leonardo
    Ribas, Lucas
    Ribeiro, Eraldo
    Bruno, Odemir
    NETWORK SCIENCE (NETSCI-X 2022), 2022, 13197 : 54 - 66
  • [24] Triplet Embedding Convolutional Recurrent Neural Network for Long Text Semantic Analysis
    Liu, Jingxuan
    Zhu, Ming
    Ouyang, Huajiang
    Sun, Guozi
    Li, Huakang
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2022, 2022, 13724 : 607 - 615
  • [25] Text Classification Model Based on fastText
    Yao, Tengjun
    Zhai, Zhengang
    Gao, Bingtao
    PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 154 - 157
  • [26] Deep learning classification of biomedical text using convolutional neural network
    Dollah R.
    Sheng C.Y.
    Zakaria N.
    Othman M.S.
    Rasib A.W.
    International Journal of Advanced Computer Science and Applications, 2019, 10 (08): : 512 - 517
  • [27] Arabic Text Classification Using Convolutional Neural Network and Genetic Algorithms
    Alsaleh, Deem
    Larabi-Marie-Sainte, Souad
    IEEE ACCESS, 2021, 9 (09): : 91670 - 91685
  • [28] Deep Learning Classification of Biomedical Text using Convolutional Neural Network
    Dollah, Rozilawati
    Sheng, Chew Yi
    Zakaria, Norhawaniah
    Othman, Mohd Shahizan
    Rasib, Abd Wahid
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (08) : 512 - 517
  • [29] Text Feature Extraction and Classification Based on Convolutional Neural Network (CNN)
    Zhang, Taohong
    Li, Cunfang
    Cao, Nuan
    Ma, Rui
    Zhang, ShaoHua
    Ma, Nan
    DATA SCIENCE, PT 1, 2017, 727 : 472 - 485
  • [30] A Combined-Convolutional Neural Network for Chinese News Text Classification
    Zhang Y.
    Liu K.-F.
    Zhang Q.-X.
    Wang Y.-G.
    Gao K.-L.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (06): : 1059 - 1067