Testing semantic similarity measures for extracting synonyms from a corpus

被引:0
|
作者
Ferret, Olivier [1 ]
机构
[1] CEA, LIST, Vis & Content Engn Lab, F-92265 Fontenay Aux Roses, France
关键词
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
The definition of lexical semantic similarity measures has been the subject of lots of works for many years. In this article, we focus more specifically on distributional semantic similarity measures. Although several evaluations of this kind of measures were already achieved for determining if they actually catch semantic relatedness, it is still difficult to determine if a measure that performs well in an evaluation framework can be applied more widely with the same success. In the work we present here, we first select a semantic similarity measure by testing a large set of such measures against the WordNet-based Synonymy Test, an extended TOEFL test proposed in (Freitag et al., 2005), and we show that its accuracy is comparable to the accuracy of the best state of the art measures while it has less demanding requirements. Then, we apply this measure for extracting automatically synonyms from a corpus and we evaluate the relevance of this process against two reference resources, WordNet and the Moby thesaurus. Finally, we compare our results in details to those of (Curran and Moens, 2002).
引用
收藏
页码:3338 / 3343
页数:6
相关论文
共 50 条
  • [1] A French Corpus for Semantic Similarity
    Cardon, Remi
    Grabar, Natalia
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6889 - 6894
  • [2] A Semantic Framework for Extracting Taxonomic Relations from Text Corpus
    Phuoc Thi Hong Doan
    Arch-int, Ngamnij
    Arch-int, Somjit
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (03) : 325 - 337
  • [3] Comparing measures of semantic similarity
    Ljubesic, Nikola
    Boras, Damir
    Bakaric, Nikola
    Njavro, Jasmina
    [J]. PROCEEDINGS OF THE ITI 2008 30TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY INTERFACES, 2008, : 675 - 681
  • [4] On semantic properties of interestingness measures for extracting rules from data
    Maddouri, Mondher
    Gammoudi, Jamil
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, PT 1, 2007, 4431 : 148 - +
  • [5] Clustering for semantic purposes Exploration of semantic similarity in a technical corpus
    Bertels, Ann
    Speelman, Dirk
    [J]. TERMINOLOGY, 2014, 20 (02): : 279 - 303
  • [6] The semantic measures library: Assessing semantic similarity from knowledge representation analysis
    Harispe, Sébastien
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8455 LNCS : 254 - 257
  • [7] Extracting equivalents from aligned parallel texts: Comparison of measures of similarity
    Ribeiro, A
    Lopes, GP
    Mexia, J
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, 2000, 1952 : 339 - 349
  • [8] Extracting Named Entities and Synonyms from Wikipedia
    Bohn, Christian
    Norvag, Kjetil
    [J]. 2010 24TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2010, : 1300 - 1307
  • [9] Influence of Token Similarity Measures for Semantic Textual Similarity
    Sowmya, V.
    Vardhan, Vishnu B.
    Raju, Bhadri M. S. V. S.
    [J]. 2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 41 - 44
  • [10] Similarity of Sentences With Contradiction Using Semantic Similarity Measures
    Prasad, M. Krishna Siva
    Sharma, Poonam
    [J]. COMPUTER JOURNAL, 2022, 65 (03): : 701 - 717