Comparative study of word embeddings models and their usage in Arabic language applications

被引:0
|
作者
Suleiman, Dima [1 ,2 ]
Awajan, Arafat [1 ]
机构
[1] Princess Sumaya Univ Technol, King Hussein Fac Comp Sci, Dept Comp Sci, Amman, Jordan
[2] Univ Jordan, Dept Informat Technol, Amman, Jordan
关键词
word embeddings; deep learning; sentiment analysis; word2vec; Glove; semantic similarity; CBOW; Skip-grant;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word embeddings is the representation of the text using vectors such that the words that have similar syntax and semantic will have similar vector representation. Representing words using vectors is very crucial for most of natural language processing applications. In natural language, when using neural network for processing, the words vectors will be fed as input to the network. In this paper, a comparative study of several word embeddings models is conducted including Glove and the two approaches of word2vec model called CBOW and Skip-gram. Furthermore, this study surveying most of the state-of-art of using word embeddings in Arabic language applications such as sentiment analysis, semantic similarity, short answer grading, information retrieval, paraphrase identification, plagiarism detection and Textual Entailment.
引用
收藏
页码:64 / 70
页数:7
相关论文
共 50 条
  • [21] Arabic Text Classification Based on Word and Document Embeddings
    El Mahdaouy, Abdelkader
    Gaussier, Eric
    El Alaoui, Said Ouatik
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 32 - 41
  • [22] AltibbiVec: A Word Embedding Model for Medical and Health Applications in the Arabic Language
    Habib, Maria
    Faris, Mohammad
    Alomari, Alaa
    Faris, Hossam
    [J]. IEEE ACCESS, 2021, 9 : 133875 - 133888
  • [23] Creating Welsh Language Word Embeddings
    Corcoran, Padraig
    Palmer, Geraint
    Arman, Laura
    Knight, Dawn
    Spasic, Irena
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (15):
  • [24] Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning
    Aljuhani, Khulood O.
    Alyoubi, Khaled H.
    Alotaibi, Fahd S.
    [J]. TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2022, 16 (03): : 394 - 400
  • [25] The Word, a Study of the Foundations of Language and Speech-Usage.
    Zipf, George Kingsley
    [J]. MODERN LANGUAGE JOURNAL, 1938, 23 (03): : 232 - 233
  • [26] Improving interpretability of word embeddings by generating definition and usage
    Zhang, Haitong
    Du, Yongping
    Sun, Jiaxin
    Li, Qingxiao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160 (160)
  • [27] Integrating Word Embeddings into IBM Word Alignment Models
    Anh-Cuong Le
    Tuan-Phong Nguyen
    Quoc-Long Tran
    [J]. PROCEEDINGS OF 2018 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2018, : 79 - 84
  • [28] Word Recognition in Arabic as a Foreign Language
    Hansen, Gunna Funder
    [J]. MODERN LANGUAGE JOURNAL, 2010, 94 (04): : 567 - 581
  • [29] Arabic Sentiment Analysis Based on Word Embeddings and Deep Learning
    Elhassan, Nasrin
    Varone, Giuseppe
    Ahmed, Rami
    Gogate, Mandar
    Dashtipour, Kia
    Almoamari, Hani
    El-Affendi, Mohammed A.
    Al-Tamimi, Bassam Naji
    Albalwy, Faisal
    Hussain, Amir
    [J]. COMPUTERS, 2023, 12 (06)
  • [30] A comprehensive review on Arabic word sense disambiguation for natural language processing applications
    Kaddoura, Sanaa
    Ahmed, Rowanda D.
    Hemanth, Jude D.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (04)