Similarity between the Association Measures: a Case Study of Noun Phrases

被引:0
|
作者
Khokhlova, Maria [1 ]
机构
[1] St Petersburg State Univ, Dept Math Linguist, Univ Skaya Emb 11, St Petersburg 199034, Russia
关键词
collocability; collocations; corpora; statistics; statistical measures; gold standard;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Collocation extraction has gained much attention in natural language processing, its results are important in various areas of applied linguistics. The research focuses on a comparison between over a dozen of association measures based on a subset of the Russian Web corpus. The paper studies the automatically extracted Adj-Noun collocations. The aim of the experiments is two-fold. First, to examine the difference between statistical measures and second to find the most effective one for the Russian data. The former assumes the calculation of the Spearman's rank correlation coefficient and the latter implies the evaluation of the extracted lists against a Russian dictionary, i.e. identifying automatically extracted and manually collected collocations. The results are not such straightforward, one can distinguish between groups of measures that demonstrate a relative interchangeability. Also the produced bigrams can be considered as collocations by experts and thus may enrich dictionaries.
引用
收藏
页码:21 / 27
页数:7
相关论文
共 50 条
  • [11] The interplay between classifier choice and animacy in Mandarin-Chinese noun phrases: an ERP study
    Frankowsky, Maximilian
    Ke, Dan
    Zwitserlood, Pienie
    Michel, Rene
    Boelte, Jens
    LANGUAGE COGNITION AND NEUROSCIENCE, 2022, 37 (07) : 866 - 882
  • [12] Comparison of Methods to Assess Similarity between Phrases
    Angles, Renzo
    Araya, Valeria
    Concha, Jesus
    Paredes, Rodrigo
    PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 255 - 263
  • [13] ANALYSIS ON THE RELATION BETWEEN STATISTICAL SIMILARITY MEASURES AND AGRICULTURAL PARAMETERS: A CASE STUDY
    Chesnokova, Olga
    Erten, Esra
    Hajnsek, Irena
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 6313 - 6316
  • [14] RHYTHMIC STRUCTURE IN ITALIAN NOUN PHRASES - A STUDY ON VOWEL DURATIONS
    FARNETANI, E
    KORI, S
    PHONETICA, 1990, 47 (1-2) : 50 - 65
  • [15] Positions for oblique case-marked arguments in Hungarian noun phrases
    Farkas, Judit
    Alberti, Gabor
    JEZIKOSLOVLJE, 2016, 17 (1-2): : 295 - 319
  • [16] The Role of Discourse Prominence in Antecedent Search: The Case of Genitive Noun Phrases
    Kennison, Shelia M.
    DISCOURS-REVUE DE LINGUISTIQUE PSYCHOLINGUISTIQUE ET INFORMATIQUE, 2016, (18):
  • [17] A Syntactic Study on Attributive Noun Phrases of the Combined Type and the Cohesive Type
    Man Zaijiang
    YUYAN KEXUE-LINGUISTIC SCIENCES, 2016, 15 (06): : 588 - 598
  • [19] A Further Study on Attributive Noun Phrases " NP de VP" in Chinese
    Wu Zaosheng
    Guo Yiding
    YUYAN KEXUE-LINGUISTIC SCIENCES, 2018, 17 (04): : 385 - 397
  • [20] An MEG study of temporal characteristics of semantic integration in Japanese noun phrases
    Kiguchi, Hirohisa
    Asakura, Nobuhiko
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (06) : 1656 - 1663