Identifying lexical compatibilities of words by vectors of specialized words

被引:0
|
作者
Baimuratov, O. A. [1 ]
Ayazbayev, D. A. [1 ]
机构
[1] Suleyman Demirel Univ, Kaskelen, Kazakhstan
关键词
vectors of words; Skip-gram model; lexically compatibilities of words;
D O I
10.26577/JMMCS.2020.v107.i3.07
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In court system secretary fills protocols. Filling protocols with mistakes can lead to misunderstanding between people. Hence it is important writing protocols properly. In current work to identify mistakes lexical compatibilities of words were computed. To do it Skip-gram model was applied. In Skip-gram model words are represented by vectors. Words with similar meaning and lexically compatible words should have approximately the same direction. Therefore to calculate lexical compatibility of two words cosine value of angle between corresponding two vectors was identified. Cosine value of highly lexically compatible words should be approximately equal to 1. Lexically incompatible words should approximately have value -1. To test their system authors used the text of article of the constitution of the Republic of Kazakhstan. Particularly, words which are not related to meaning of article of the constitution were inserted, and the system had to identify that inserted words. The system for some words showed high accuracy, however some words showed low accuracy. By authors' opinion, it happened because even inserted words were not related in meaning, they could be lexically compatible with their neighbors. For example, word computer can be used in other contexts with word (old) of Kazakh language. This research is carried out within the framework of the Ministry of Education and Science of Republic of Kazakhstan grant project "Developing and implementing the innovative competency-based model of multilingual IT specialist in the course of national education system modernization".
引用
收藏
页码:67 / 73
页数:7
相关论文
共 50 条
  • [1] Specializing Distributional Vectors of All Words for Lexical Entailment
    Kamath, Aishwarya
    Pfeiffer, Jonas
    Ponti, Edoardo M.
    Glava, Goran
    Vulic, Ivan
    [J]. 4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 72 - 83
  • [2] Automatically Identifying the Source Words of Lexical Blends in English
    Cook, Paul
    Stevenson, Suzanne
    [J]. COMPUTATIONAL LINGUISTICS, 2010, 36 (01) : 129 - 149
  • [3] Fast words, slow words (theory of lexical diffusion)
    Phillips, BS
    [J]. AMERICAN SPEECH, 2000, 75 (04) : 414 - 416
  • [4] LEXICAL DIFFUSION AND FUNCTION WORDS
    PHILLIPS, BS
    [J]. LINGUISTICS, 1983, 21 (03) : 487 - 499
  • [5] LEXICAL MORPHOLOGY: THE FORMATION OF WORDS
    Penas Ibanez, Azucena
    [J]. ESTUDIOS DE LINGUISTICA-UNIVERSIDAD DE ALICANTE-ELUA, 2009, (23): : 415 - 417
  • [6] Fightin' Words: Lexical Feature Selection and Evaluation for Identifying the Content of Political Conflict
    Monroe, Burt L.
    Colaresi, Michael P.
    Quinn, Kevin M.
    [J]. POLITICAL ANALYSIS, 2008, 16 (04) : 372 - 403
  • [7] Segmenting unrestricted Chinese text into prosodic words instead of lexical words
    Qian, Y
    Chu, M
    Peng, H
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 825 - 828
  • [8] Predicting new words from newer words: Lexical borrowings in French
    Chesley, Paula
    Baayen, R. Harald
    [J]. LINGUISTICS, 2010, 48 (06) : 1343 - 1374
  • [9] LEXICAL STORAGE AND RETRIEVAL OF PREFIXED WORDS
    TAFT, M
    FORSTER, KI
    [J]. JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1975, 14 (06): : 638 - 647
  • [10] LEXICAL AND SENTENTIAL PRIMING OF AMBIGUOUS WORDS
    Yip, Michael C. W.
    [J]. PSYCHOLOGIA, 2008, 51 (03) : 196 - 205