A Comparative Text Classification Study with Deep Learning-Based Algorithms

被引:5
|
作者
Koksal, Omer [1 ]
Akgul, Ozlem [2 ]
机构
[1] ASELSAN, Artificial Intelligence & Informat Technol Dept, Ankara, Turkey
[2] Middle East Tech Univ, Elect & Elect Engn Dept, Ankara, Turkey
关键词
text classification; deep learning; convolutional neural network; recurrent neural network; LSTM; GRU;
D O I
10.1109/ICEEE55327.2022.9772587
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a well-known Natural Language Processing (NLP) task, text classification can be defined as the process of categorizing documents depending on their content. In this process, selecting classification algorithms and tuning classification parameters are crucial for efficient classification. In recent years, many deep learning algorithms have been used successfully in text classification tasks. This paper performed a comparative study utilizing and optimizing several deep learning-based algorithms. We have implemented deep neural networks (DNN), convolutional neural networks (CNN), long shortest-term memory (LSTM), and gated recurrent units (GRU). In addition, we performed extensive experiments by tuning hyperparameters to improve classification accuracy. In addition, we implemented word embeddings techniques to acquire feature vectors of text data. Then we compared our word embeddings results with traditional TF-IDF vectorization results of DNN and CNN. In our experiments, we used an open-source Turkish News benchmarking dataset to compare our results with previous studies in the literature. Our experimental results revealed significant improvements in classification performance using word embeddings with deep learning-based algorithms and tuning hyperparameters. Furthermore, our work outperformed previous results on the selected dataset.
引用
收藏
页码:387 / 391
页数:5
相关论文
共 50 条
  • [1] Comparative Study of Deep Learning-Based Sentiment Classification
    Seo, Seungwan
    Kim, Czangyeob
    Kim, Haedong
    Mo, Kyounghyun
    Kang, Pilsung
    [J]. IEEE ACCESS, 2020, 8 : 6861 - 6875
  • [2] Analytics of machine learning-based algorithms for text classification
    Hassan, Sayar Ul
    Ahamed, Jameel
    Ahmad, Khaleel
    [J]. Sustainable Operations and Computers, 2022, 3 : 238 - 248
  • [3] Deep Learning-based Text Classification: A Comprehensive Review
    Minaee, Shervin
    Kalchbrenner, Nal
    Cambria, Erik
    Nikzad, Narjes
    Chenaghlu, Meysam
    Gao, Jianfeng
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (03)
  • [4] Road object detection: a comparative study of deep learning-based algorithms
    Mahaur, Bharat
    Singh, Navjot
    Mishra, K. K.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 14247 - 14282
  • [5] Road object detection: a comparative study of deep learning-based algorithms
    Bharat Mahaur
    Navjot Singh
    K. K. Mishra
    [J]. Multimedia Tools and Applications, 2022, 81 : 14247 - 14282
  • [6] Improving automated Turkish text classification with learning-based algorithms
    Koksal, Omer
    Yilmaz, Eyup Halit
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (11):
  • [7] Phishing Webpage Classification via Deep Learning-Based Algorithms: An Empirical Study
    Nguyet Quang Do
    Selamat, Ali
    Krejcar, Ondrej
    Yokoi, Takeru
    Fujita, Hamido
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (19):
  • [8] A Deep Learning-Based Text Classification of Adverse Nursing Events
    Lu, Wenjing
    Jiang, Wei
    Zhang, Na
    Xue, Feng
    [J]. JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021
  • [9] Deep Learning-Based Sentiment Classification: A Comparative Survey
    Mabrouk, Alhassan
    Diaz Redondo, Rebeca P.
    Kayed, Mohammed
    [J]. IEEE ACCESS, 2020, 8 : 85616 - 85638
  • [10] A comparative study on various pre-processing techniques and deep learning algorithms for text classification
    Bhuvaneshwari, P.
    Rao, A. Nagaraja
    [J]. International Journal of Cloud Computing, 2022, 11 (01): : 61 - 78