Impact of convolutional neural network and FastText embedding on text classification

被引:44
|
作者
Umer, Muhammad [1 ]
Imtiaz, Zainab [2 ]
Ahmad, Muhammad [3 ]
Nappi, Michele [4 ]
Medaglia, Carlo [5 ]
Choi, Gyu Sang [6 ]
Mehmood, Arif [1 ]
机构
[1] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Pakistan
[2] Khwaja Fareed Univ Engn & Informat Technol KFUEIT, Dept Comp Sci, Rahim Yar Khan, Pakistan
[3] Khwaja Fareed Univ Engn & Informat Technol KFUEIT, Dept Comp Engn, Rahim Yar Khan, Pakistan
[4] Univ Salerno, Dept Comp Sci, Fisciano, Italy
[5] Link Campus Univ Rome, Res Dept, Via Casale San Pio V 44, I-00165 Rome, Italy
[6] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, South Korea
基金
新加坡国家研究基金会;
关键词
Convolutional Neural Network (CNN); FastText; Text mining; Deep learning; Natural language processing; SENTIMENT;
D O I
10.1007/s11042-022-13459-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient word representation techniques (word embeddings) with modern machine learning models have shown reasonable improvement on automatic text classification tasks. However, the effectiveness of such techniques has not been evaluated yet in terms of insufficient word vector representation for training. Convolutional Neural Network has achieved significant results in pattern recognition, image analysis, and text classification. This study investigates the application of the CNN model on text classification problems by experimentation and analysis. We trained our classification model with a prominent word embedding generation model, Fast Text on publically available datasets, six benchmark datasets including Ag News, Amazon Full and Polarity, Yahoo Question Answer, Yelp Full, and Polarity. Furthermore, the proposed model has been tested on the Twitter US airlines non-benchmark dataset as well. The analysis indicates that using Fast Text as word embedding is a very promising approach.
引用
收藏
页码:5569 / 5585
页数:17
相关论文
共 50 条
  • [31] A morpheme sequence and convolutional neural network based Kazakh text classification
    Parhat, Sardar
    Ting, Gao
    Ablimit, Mijit
    Hamdulla, Askar
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1903 - 1906
  • [32] Variable Convolution and Pooling Convolutional Neural Network for Text Sentiment Classification
    Dong M.
    Li Y.
    Tang X.
    Xu J.
    Bi S.
    Cai Y.
    IEEE Access, 2020, 8 : 16174 - 16186
  • [33] Deep Convolutional Neural Network for Knowledge-Infused Text Classification
    Malik, Sonika
    Jain, Sarika
    NEW GENERATION COMPUTING, 2024, 42 (01) : 157 - 176
  • [34] Binary and Multiclass Text Classification by Means of Separable Convolutional Neural Network
    Solovyeva, Elena
    Abdullah, Ali
    INVENTIONS, 2021, 6 (04)
  • [35] Semantic Template-based Convolutional Neural Network for Text Classification
    Chang, Yung-Chun
    Ng, Siu Hin
    Chen, Jung-Peng
    Liang, Yu-Chi
    Hsu, Wen-Lian
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [36] Improving convolutional neural network for text classification by recursive data pruning
    Li, Qi
    Li, Pengfei
    Mao, Kezhi
    Lo, Edmond Yat-Man
    NEUROCOMPUTING, 2020, 414 : 143 - 152
  • [37] Deep Convolutional Neural Network for Knowledge-Infused Text Classification
    Sonika Malik
    Sarika Jain
    New Generation Computing, 2024, 42 : 157 - 176
  • [38] Variable Convolution and Pooling Convolutional Neural Network for Text Sentiment Classification
    Dong, Min
    Li, Yongfa
    Tang, Xue
    Xu, Jingyun
    Bi, Sheng
    Cai, Yi
    IEEE ACCESS, 2020, 8 : 16174 - 16186
  • [39] Adaptive Multiscale Superpixel Embedding Convolutional Neural Network for Land Use Classification
    Zhang, Huaizhong
    Altham, Callum
    Trovati, Marcello
    Zhang, Ce
    Rolland, Iain
    Lawal, Lanre
    Wegbu, Dozien
    Ajienka, Nemitari
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 7631 - 7642
  • [40] Speech-Act Classification Using Convolutional Neural Network and Word Embedding
    Bae, Kyoungman
    Ko, Youngjoong
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (06)