A Systematic Literature Review on Word Embeddings

被引:19
|
作者
Gutierrez, Luis [1 ]
Keith, Brian [1 ]
机构
[1] Univ Catolica Norte, Dept Comp & Syst Engn, Av Angamos 0610, Antofagasta, Chile
关键词
Bayesian networks; Sentiment analysis; Literature review; Opinion mining;
D O I
10.1007/978-3-030-01171-0_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a systematic literature review on word embeddings within the field of natural language processing and text processing. A search and classification of 140 articles on proposals of word embeddings or their application was carried out from three different sources. Word embeddings have been widely adopted with satisfactory results in natural language processing tasks in general and other domains with good results. In this paper, we report the hegemony of word embeddings based on neural models over those generated by matrix factorization (i.e., variants of word2vec). Finally, despite the good performance of word embeddings, some drawbacks and their respective solution proposals are identified, such as the lack of interpretability of the real values that make up the embedded vectors.
引用
收藏
页码:132 / 141
页数:10
相关论文
共 50 条
  • [1] Profiling of Intertextuality in Latin Literature Using Word Embeddings
    Burns, Patrick J.
    Brofos, James A.
    Li, Kyle
    Chaudhuri, Pramit
    Dexter, Joseph P.
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4900 - 4907
  • [2] Word Segmentation Task for Southeast Asian Abugida Scripts: A Systematic Literature Review
    Wicaksono, Baskoro Adi
    Hantono, Bimo Sunarfri
    Adji, Teguh Bharata
    [J]. Proceedings - 2024 2nd International Conference on Technology Innovation and Its Applications, ICTIIA 2024, 2024,
  • [3] Cross-cultural electronic word-of-mouth: a systematic literature review
    Kusawat, Poompak
    Teerakapibal, Surat
    [J]. SPANISH JOURNAL OF MARKETING-ESIC, 2024, 28 (02) : 126 - 143
  • [4] Thesaurus-based word embeddings for automated biomedical literature classification
    Dimitrios A. Koutsomitropoulos
    Andreas D. Andriopoulos
    [J]. Neural Computing and Applications, 2022, 34 : 937 - 950
  • [5] Thesaurus-based word embeddings for automated biomedical literature classification
    Koutsomitropoulos, Dimitrios A.
    Andriopoulos, Andreas D.
    [J]. NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 937 - 950
  • [6] Combining word embeddings to extract chemical and drug entities in biomedical literature
    Lopez-Ubeda, Pilar
    Diaz-Galiano, Manuel Carlos
    Urena-Lopez, L. Alfonso
    Martin-Valdivia, M. Teresa
    [J]. BMC BIOINFORMATICS, 2021, 22 (SUPPL 1)
  • [7] Combining word embeddings to extract chemical and drug entities in biomedical literature
    Pilar López-Úbeda
    Manuel Carlos Díaz-Galiano
    L. Alfonso Ureña-López
    M. Teresa Martín-Valdivia
    [J]. BMC Bioinformatics, 22
  • [8] Socialized Word Embeddings
    Zeng, Ziqian
    Yin, Yichun
    Song, Yangqiu
    Zhang, Ming
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3915 - 3921
  • [9] Urdu Word Embeddings
    Haider, Samar
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 964 - 968
  • [10] Dynamic Word Embeddings
    Bamler, Robert
    Mandt, Stephan
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70