Evaluation of Sentence Embedding Models for Natural Language Understanding Problems in Russian

被引:1
|
作者
Popov, Dmitry [1 ]
Pugachev, Alexander [1 ]
Svyatokum, Polina [1 ]
Svitanko, Elizaveta [1 ]
Artemova, Ekaterina [1 ]
机构
[1] Natl Res Univ Higher Sch Econ, Moscow, Russia
关键词
Multiple choice question answering; Next sentence prediction; Paraphrase identification; Sentence embedding;
D O I
10.1007/978-3-030-37334-4_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the performance of sentence embeddings models on several tasks for the Russian language. In our comparison, we include such tasks as multiple choice question answering, next sentence prediction, and paraphrase identification. We employ FastText embeddings as a baseline and compare it to ELMo and BERT embeddings. We conduct two series of experiments, using both unsupervised (i.e., based on similarity measure only) and supervised approaches for the tasks. Finally, we present datasets for multiple choice question answering and next sentence prediction in Russian.
引用
收藏
页码:205 / 217
页数:13
相关论文
共 50 条
  • [1] Analysis of sentence embedding models using prediction tasks in natural language processing
    Adi, Y.
    Kermany, E.
    Belinkov, Y.
    Lavi, O.
    Goldberg, Y.
    [J]. IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2017, 61 (4-5)
  • [3] Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation
    Poliak, Adam
    Haldar, Aparajita
    Rudinger, Rachel
    Hu, J. Edward
    Pavlick, Ellie
    White, Aaron Steven
    Van Durme, Benjamin
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 67 - 81
  • [4] Character-based Embedding Models and Reranking Strategies for Understanding Natural Language Meal Descriptions
    Korpusik, Mandy
    Collins, Zachary
    Glass, James
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3320 - 3324
  • [5] Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language Understanding
    Ghaddar, Abbas
    Wu, Yimeng
    Bagga, Sunyam
    Rashid, Ahmad
    Bibi, Khalil
    Rezagholizadeh, Mehdi
    Xing, Chao
    Wang, Yasheng
    Xinyu, Duan
    Wang, Zhefeng
    Huai, Baoxing
    Jiang, Xin
    Liu, Qun
    Langlais, Philippe
    [J]. arXiv, 2022,
  • [6] MODELS OF NATURAL-LANGUAGE UNDERSTANDING
    BATES, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (22) : 9977 - 9982
  • [7] RussianSuperGLUE: A Russian Language Understanding Evaluation Benchmark
    Shavrina, Tatiana
    Fenogenova, Alena
    Emelyanov, Anton
    Shevelev, Denis
    Artemova, Ekaterina
    Malykh, Valentin
    Mikhailov, Vladislav
    Tikhonova, Maria
    Chertok, Andrey
    Evlampiev, Andrey
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4717 - 4726
  • [8] Enhancing performance of transformer-based models in natural language understanding through word importance embedding
    Hong, Seung-Kyu
    Jang, Jae-Seok
    Kwon, Hyuk-Yoon
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [9] Learning Directional Sentence-Pair Embedding for Natural Language Reasoning (Student Abstract)
    Jiang, Yuchen
    Xiao, Zhenxin
    Chang, Kai-Wei
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13825 - 13826
  • [10] Fertility models for statistical natural language understanding
    Della Pietra, S
    Epstein, M
    Roukos, S
    Ward, T
    [J]. 35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, : 168 - 173