Comparative study of word embeddings models and their usage in Arabic language applications

被引:0
|
作者
Suleiman, Dima [1 ,2 ]
Awajan, Arafat [1 ]
机构
[1] Princess Sumaya Univ Technol, King Hussein Fac Comp Sci, Dept Comp Sci, Amman, Jordan
[2] Univ Jordan, Dept Informat Technol, Amman, Jordan
关键词
word embeddings; deep learning; sentiment analysis; word2vec; Glove; semantic similarity; CBOW; Skip-grant;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word embeddings is the representation of the text using vectors such that the words that have similar syntax and semantic will have similar vector representation. Representing words using vectors is very crucial for most of natural language processing applications. In natural language, when using neural network for processing, the words vectors will be fed as input to the network. In this paper, a comparative study of several word embeddings models is conducted including Glove and the two approaches of word2vec model called CBOW and Skip-gram. Furthermore, this study surveying most of the state-of-art of using word embeddings in Arabic language applications such as sentiment analysis, semantic similarity, short answer grading, information retrieval, paraphrase identification, plagiarism detection and Textual Entailment.
引用
收藏
页码:64 / 70
页数:7
相关论文
共 50 条
  • [1] Pretrained Transformer Language Models Versus Pretrained Word Embeddings for the Detection of Accurate Health Information on Arabic Social Media: Comparative Study
    Albalawi, Yahya
    Nikolov, Nikola S.
    Buckley, Jim
    [J]. JMIR FORMATIVE RESEARCH, 2022, 6 (06)
  • [2] A Comparative Study of Pre-trained Word Embeddings for Arabic Sentiment Analysis
    Zouidine, Mohamed
    Khalil, Mohammed
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1243 - 1248
  • [3] A Comparative Study of Word Embedding Models for Arabic Text Processing
    Assiri, Fatmah
    Alghamdi, Nuha
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (09): : 399 - 403
  • [4] COMPARATIVE STUDY OF ARABIC AND FRENCH STATISTICAL LANGUAGE MODELS
    Meftouh, Karima
    Smaili, Kamel
    Laskri, Mohamed Tayeb
    [J]. ICAART 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, 2009, : 156 - +
  • [5] A Comparative Study of Word Embedding Models for Arabic Text Processing
    Assiri, Fatmah
    Alghamdi, Nuha
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (08): : 399 - 403
  • [6] Dissecting word embeddings and language models in natural language processing
    Verma, Vivek Kumar
    Pandey, Mrigank
    Jain, Tarun
    Tiwari, Pradeep Kumar
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2021, 24 (05): : 1509 - 1515
  • [7] Word Embeddings for Arabic Sentiment Analysis
    Altowayan, A. Aziz
    Tao, Lixin
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3820 - 3825
  • [8] The Impact of Arabic Diacritization on Word Embeddings
    Abbache, Mohamed
    Abbache, Ahmed
    Xu, Jingwen
    Meziane, Farid
    Wen, Xianbin
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (06)
  • [9] Methodical Evaluation of Arabic Word Embeddings
    Elrazzaz, Mohammed
    Elbassuoni, Shady
    Shaban, Khaled
    Helwe, Chadi
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 2, 2017, : 454 - 458
  • [10] Hybrid Word/Part-of-Arabic-Word Language Models For Arabic Text Document Recognition
    BenZeghiba, Mohamed Faouzi
    Louradour, Jerome
    Kermorvant, Christopher
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 671 - 675