Turkish Text Classification with Machine Learning and Transfer Learning

被引:4
|
作者
Aydogan, Murat [1 ]
Karci, Ali [2 ]
机构
[1] Bingo Univ, Genc Meslek Yuksekokulu, Bingol, Turkey
[2] Inonu Univ, Bilgisayar Muhendisligi Bolumu, Malatya, Turkey
关键词
Turkish text classification; machine learning; word embedding; transfer learning;
D O I
10.1109/idap.2019.8875919
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of text classification is one of the most fundamental topics of study in the field of natural language processing, but when reviewing the literature, it is seen that there is an inadequate number of studies for the issue of Turkish text classification. Two different Turkish datasets were created for this aim. Word vectors were created on the first dataset of unlabeled texts. These word vectors were transferred to the second dataset created with data collected from various news sites by transfer learning. Text classification was applied with the machine learning algorithms on this dataset. The effects of transfer learning and transferring of word vectors on the accuracy rate and the performance of machine learning methods were analyzed in detail. When studying the experimental results, it was determined that Support Vector Machine model was performed more successful and It was seen that the accuracy rate was improved with transfer learning.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] The Effect of Transfer Learning on Turkish Text Classification
    Sahin, Gurkan
    Diri, Banu
    [J]. 29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [2] Active Learning for Turkish Text Classification
    Sapci, Ali Osman Berk
    Tastan, Oznur
    Yeniterzi, Reyyan
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [3] Machine Learning-Based Text Classification Comparison: Turkish Language Context
    Alzoubi, Yehia Ibrahim
    Topcu, Ahmet E.
    Erkaya, Ahmed Enis
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [4] Transfer Learning beyond Text Classification
    Yang, Qiang
    [J]. ADVANCES IN MACHINE LEARNING, PROCEEDINGS, 2009, 5828 : 10 - 22
  • [5] Machine Learning Based Text Summarization for Turkish News
    Kartal, Yavuz Selim
    Kutlu, Mucahid
    [J]. 2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [6] Sentiment Analysis in Turkish Text with Machine Learning Algorithms
    Rumelli, Merve
    Akkus, Deniz
    Kart, Ozge
    Isik, Zerrin
    [J]. 2019 INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS CONFERENCE (ASYU), 2019, : 123 - 127
  • [7] A Review of Machine Learning Algorithms for Text Classification
    Li, Ruiguang
    Liu, Ming
    Xu, Dawei
    Gao, Jiaqi
    Wu, Fudong
    Zhu, Liehuang
    [J]. CYBER SECURITY, CNCERT 2021, 2022, 1506 : 226 - 234
  • [8] Application of machine learning method in text classification
    Sui, Zhenhuan
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 120 - 120
  • [9] Machine learning for Asian language text classification
    Peng, Fuchun
    Huang, Xiangji
    [J]. JOURNAL OF DOCUMENTATION, 2007, 63 (03) : 378 - 397
  • [10] Machine Learning Approach for Text Classification in Cybercrime
    Kumari, Swati
    Saquib, Zia
    Pawar, Sanjay
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,