A Comparison of fastText Implementations Using Arabic Text Classification

被引:1
|
作者
Alghamdi, Nuha [1 ,2 ]
Assiri, Fatmah [2 ]
机构
[1] King Abdulaziz Univ, Jeddah, Saudi Arabia
[2] Univ Jeddah, Jeddah, Saudi Arabia
关键词
Word embeddings; NLP; Arabic classification; Machine learning; fastText;
D O I
10.1007/978-3-030-29513-4_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quality of word representation is crucial to obtain good results in many natural language processing tasks. Recently, many word representation models (word embeddings), such as fastText, have been developed. In this research, we compared the algorithms for the fastText implementation, Facebook's official implementation, and Gensim's implementation using the same pre-trained fastText model. Using multiclass classification, we evaluated these embeddings. According to the results, the Facebook implementation performed better than Gensim's implementation, with an average accuracy of 78.22% and 56.73%, respectively, for sentence embeddings and an average accuracy of 79.43% and 57.95%, respectively, for word embeddings.
引用
收藏
页码:306 / 311
页数:6
相关论文
共 50 条
  • [1] Text Classification Model Based on fastText
    Yao, Tengjun
    Zhai, Zhengang
    Gao, Bingtao
    [J]. PROCEEDINGS OF 2020 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS), 2020, : 154 - 157
  • [2] A Comparison of Text-Classification Techniques Applied to Arabic Text
    Kanaan, Ghassan
    Al-Shalabi, Riyad
    Ghwanmeh, Sameh
    Al-Ma'adeed, Hamda
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (09): : 1836 - 1844
  • [3] Text classification framework for short text based on TFIDF-FastText
    Shrutika Chawla
    Ravreet Kaur
    Preeti Aggarwal
    [J]. Multimedia Tools and Applications, 2023, 82 : 40167 - 40180
  • [4] Arabic text classification using Polynomial Networks
    Al-Tahrawi, Mayy M.
    Al-Khatib, Sumaya N.
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (04) : 437 - 449
  • [5] Text classification framework for short text based on TFIDF-FastText
    Chawla, Shrutika
    Kaur, Ravreet
    Aggarwal, Preeti
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 40167 - 40180
  • [6] Text Classification of Flu-related Tweets Using FastText with Sentiment and Keyword Features
    Alessa, Ali
    Faezipour, Miad
    Alhassan, Zakhriya
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, : 366 - 367
  • [7] Arabic text classification using deep learning models
    Elnagar, Ashraf
    Al-Debsi, Ridhwan
    Einea, Omar
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)
  • [8] Arabic dialects classification using text mining techniques
    AL-Walaie, Mona Abdullah
    Khan, Muhammad Badruddin
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA), 2017, : 325 - 329
  • [9] Arabic Text Classification Using Linear Discriminant Analysis
    Al-Anzi, Fawaz S.
    AbuZeina, Dia
    [J]. 2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,
  • [10] Arabic Text Mining Using Rule Based Classification
    Thabtah, Fadi
    Gharaibeh, Omar
    Al-Zubaidy, Rashid
    [J]. JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2012, 11 (01)