Comparison of Sentence Similarity Measures for Russian Paraphrase Identification

被引:0
|
作者
Pronoza, Ekaterina [1 ]
Yagunova, Elena [1 ]
机构
[1] St Petersburg State Univ, St Petersburg, Russia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we analyze and compare different types of sentence similarity measures applied to the problem of sentential paraphrase identification. We work with Russian, and all the experiments are conducted on the Russian paraphrase corpus we have collected from the news headlines (and are collecting at the moment). Apart from the similarity measures, we also analyze the corpus itself. As a result of the research we disprove the supposition that it is more difficult to distinguish between precise and loose paraphrases than between loose paraphrases and non-paraphrases. We also come up with the recommendations for the application of different similarity measures to identifying paraphrases derived from the news texts.
引用
收藏
页码:74 / 82
页数:9
相关论文
共 50 条
  • [31] Integrating Transformer and Paraphrase Rules for Sentence Simplification
    Zhao, Sanqiang
    Meng, Rui
    He, Daqing
    Saptono, Andi
    Parmanto, Bambang
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3164 - 3173
  • [32] Novel approach for constructing conversational agents using sentence similarity measures
    O'Shea, Karen
    Bandar, Zuhair
    Crockett, Keeley
    WORLD CONGRESS ON ENGINEERING 2008, VOLS I-II, 2008, : 321 - 326
  • [33] Comparative Analysis of Similarity Measures for Sentence Level Semantic Measurement of Text
    Saad, Sazianti Mohd
    Kamarudin, Siti Sakira
    2013 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2013), 2013, : 90 - +
  • [34] A Comparison of Similarity Measures for Text Documents
    Hariharan, Shanmugasundaram
    Srinivasan, Rengaramanujam
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2008, 7 (01) : 1 - 8
  • [35] Using similarity measures for histogram comparison
    Van der Weken, D
    Nachtegael, M
    Kerre, E
    FUZZY SETS AND SYSTEMS - IFSA 2003, PROCEEDINGS, 2003, 2715 : 396 - 403
  • [36] A COMPARISON OF SIMILARITY MEASURES OF FUZZY VALUES
    CHEN, SM
    YEH, MS
    HSIAO, PY
    FUZZY SETS AND SYSTEMS, 1995, 72 (01) : 79 - 89
  • [37] Hyperspectrum comparison using similarity measures
    Lopez-Molina, C.
    Marco-Detchart, C.
    Bustince, H.
    Fernandez, J.
    Lopez-Maestresalas, A.
    Ayala-Martini, D.
    2017 JOINT 17TH WORLD CONGRESS OF INTERNATIONAL FUZZY SYSTEMS ASSOCIATION AND 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (IFSA-SCIS), 2017,
  • [38] A COMPARISON OF SOME MEASURES FOR THE DETERMINATION OF INTERMOLECULAR STRUCTURAL SIMILARITY MEASURES OF INTERMOLECULAR STRUCTURAL SIMILARITY
    WILLETT, P
    WINTERMAN, V
    QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1986, 5 (01): : 18 - 25
  • [39] Paraphrase Identification by Using Clause-Based Similarity Features and Machine Translation Metrics
    Thenmozhi, D.
    Aravindan, Chandrabose
    COMPUTER JOURNAL, 2016, 59 (09): : 1289 - 1302
  • [40] On Paraphrase Identification Corpora
    Rus, Vasile
    Banjade, Rajendra
    Lintean, Mihai
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2422 - 2429