Impact of convolutional neural network and FastText embedding on text classification

被引:0
|
作者
Muhammad Umer
Zainab Imtiaz
Muhammad Ahmad
Michele Nappi
Carlo Medaglia
Gyu Sang Choi
Arif Mehmood
机构
[1] The Islamia University of Bahawalpur,Department of Computer Science & Information Technology
[2] Khwaja Fareed University of Engineering and Information Technology (KFUEIT),Department of Computer Science
[3] Khwaja Fareed University of Engineering and Information Technology (KFUEIT),Department of Computer Engineering
[4] University of Salerno,Department of Computer Science
[5] Link Campus University of Rome,Research Department
[6] Yeungnam University,Department of Information and Communication Engineering
来源
关键词
Convolutional Neural Network (CNN); FastText; Text mining; Deep learning; Natural language processing;
D O I
暂无
中图分类号
学科分类号
摘要
Efficient word representation techniques (word embeddings) with modern machine learning models have shown reasonable improvement on automatic text classification tasks. However, the effectiveness of such techniques has not been evaluated yet in terms of insufficient word vector representation for training. Convolutional Neural Network has achieved significant results in pattern recognition, image analysis, and text classification. This study investigates the application of the CNN model on text classification problems by experimentation and analysis. We trained our classification model with a prominent word embedding generation model, Fast Text on publically available datasets, six benchmark datasets including Ag News, Amazon Full and Polarity, Yahoo Question Answer, Yelp Full, and Polarity. Furthermore, the proposed model has been tested on the Twitter US airlines non-benchmark dataset as well. The analysis indicates that using Fast Text as word embedding is a very promising approach.
引用
收藏
页码:5569 / 5585
页数:16
相关论文
共 50 条
  • [41] The Impact of Preprocessing on Classification Performance in Convolutional Neural Networks for Turkish Text
    Salur, Mehmet Umut
    Aydin, Ilhan
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [42] Impact analysis of convolutional neural network in classification of satellite imagery
    Gupta, Mohan Vishal
    Dwivedi, Rakesh Kumar
    Kumar, Anil
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2023, 44 (06): : 1151 - 1166
  • [43] The Impact of Convolutional Neural Network Parameters in the Binary Classification of Mammograms
    Dicu, Madalina
    Diosan, Laura
    Andreica, Anca
    Chira, Camelia
    Cordos, Alin
    2022 24TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2022, : 181 - 188
  • [44] Impact of Convolutional Neural Network Input Parameters on Classification Performance
    Maitra, Sanjit
    Ojha, Rahul Kumar
    Ghosh, Kuntal
    2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [45] An Innovative Word Encoding Method For Text Classification Using Convolutional Neural Network
    Helmy, Amr Adel
    Omar, Yasser M. K.
    Hodhod, Rania
    2018 14TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2018, : 42 - 47
  • [46] Research on News Text Classification Based on Deep Learning Convolutional Neural Network
    Zhu, Yunlong
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [47] Modified Convolutional Neural Network Filter Gate for Social Media Text Classification
    Suhaimi, Nur Suhailayani
    Othman, Zalinda
    Yaakub, Mohd Ridzwan
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 617 - 627
  • [48] Text Classification Based on Word2vec and Convolutional Neural Network
    Li, Lin
    Xiao, Linlong
    Jin, Wenzhen
    Zhu, Hong
    Yang, Guocai
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 450 - 460
  • [49] SiNoptiC: swarm intelligence optimisation of convolutional neural network architectures for text classification
    Ferjani, Imen
    Hidri, Minyar Sassi
    Frihida, Ali
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2022, 68 (01) : 82 - 100
  • [50] Covariance Matrix Adaptation Evolution Strategy for Convolutional Neural Network in Text Classification
    Toledano-Lopez, Orlando Grabiel
    Madera, Julio
    Gonzalez, Hector
    Simon Cuevas, Alfredo
    PROGRESS IN ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2021, 13055 : 69 - 78