Similarity between the Association Measures: a Case Study of Noun Phrases

被引:0
|
作者
Khokhlova, Maria [1 ]
机构
[1] St Petersburg State Univ, Dept Math Linguist, Univ Skaya Emb 11, St Petersburg 199034, Russia
关键词
collocability; collocations; corpora; statistics; statistical measures; gold standard;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collocation extraction has gained much attention in natural language processing, its results are important in various areas of applied linguistics. The research focuses on a comparison between over a dozen of association measures based on a subset of the Russian Web corpus. The paper studies the automatically extracted Adj-Noun collocations. The aim of the experiments is two-fold. First, to examine the difference between statistical measures and second to find the most effective one for the Russian data. The former assumes the calculation of the Spearman's rank correlation coefficient and the latter implies the evaluation of the extracted lists against a Russian dictionary, i.e. identifying automatically extracted and manually collected collocations. The results are not such straightforward, one can distinguish between groups of measures that demonstrate a relative interchangeability. Also the produced bigrams can be considered as collocations by experts and thus may enrich dictionaries.
引用
收藏
页码:21 / 27
页数:7
相关论文
共 50 条
  • [21] Factors responsible for word order in the case of Japanese noun phrases with a coordinate conjunction
    Fujiki, Daisuke
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2016, 51 : 817 - 817
  • [22] Association measures for estimating semantic similarity and relatedness between biomedical concepts
    Henry, Sam
    McQuilkin, Alex
    McInnes, Bridget T.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2019, 93 : 1 - 10
  • [23] Between pronouns and R-expressions: Pronoun-like lexical noun phrases
    Kucerova, Ivona
    Szczegielniak, Adam
    CANADIAN JOURNAL OF LINGUISTICS-REVUE CANADIENNE DE LINGUISTIQUE, 2022, 67 (03): : 302 - 327
  • [24] A note on ?Similarity and dissimilarity measures between fuzzy sets: A formal relational study? and ?Additive similarity and dissimilarity measures?
    Couso, Ines
    Sanchez, Luciano
    FUZZY SETS AND SYSTEMS, 2020, 390 (390) : 183 - 187
  • [25] MEASURES OF SIMILARITY BETWEEN DISTRIBUTIONS
    VEGELIUS, J
    JANSON, S
    JOHANSSON, F
    QUALITY & QUANTITY, 1986, 20 (04) : 437 - 441
  • [26] SIMILARITY MEASURES BETWEEN IMAGES
    VANHEEL, M
    ULTRAMICROSCOPY, 1987, 21 (01) : 95 - 99
  • [27] Measures of similarity between students
    Volovici, D.
    Oprean, C.
    Volovici, R. M.
    2nd Balkan Region Conference on Engineering Education, Conference Proceedings: BRIDGES FOR CO-OPERATION IN ENGINEERING EDUCATION, 2003, : 85 - 89
  • [28] SIMILARITY BETWEEN CHILDRENS AND ADULTS ADJECTIVE RESPONSES TO NOUN STIMULI
    CLARK, DC
    JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1968, 7 (03): : 705 - &
  • [29] The Role of Sustained Attention in the Production of Conjoined Noun Phrases: An Individual Differences Study
    Jongman, Suzanne R.
    Meyer, Antje S.
    Roelofs, Ardi
    PLOS ONE, 2015, 10 (09):
  • [30] Semantic and syntactic composition of minimal adjective-noun phrases in Dutch: An MEG study
    Kochari, Arnold R.
    Lewis, Ashley G.
    Schoffelen, Jan-Mathijs
    Schriefers, Herbert
    NEUROPSYCHOLOGIA, 2021, 155