Comparison of Sentence Similarity Measures for Russian Paraphrase Identification

被引:0
|
作者
Pronoza, Ekaterina [1 ]
Yagunova, Elena [1 ]
机构
[1] St Petersburg State Univ, St Petersburg, Russia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we analyze and compare different types of sentence similarity measures applied to the problem of sentential paraphrase identification. We work with Russian, and all the experiments are conducted on the Russian paraphrase corpus we have collected from the news headlines (and are collecting at the moment). Apart from the similarity measures, we also analyze the corpus itself. As a result of the research we disprove the supposition that it is more difficult to distinguish between precise and loose paraphrases than between loose paraphrases and non-paraphrases. We also come up with the recommendations for the application of different similarity measures to identifying paraphrases derived from the news texts.
引用
收藏
页码:74 / 82
页数:9
相关论文
共 50 条
  • [1] SIMILARITY MEASURES BASED ON SENTENCE SEMANTIC STRUCTURE FOR RECOGNIZING PARAPHRASE AND ENTAILMENT
    Liu, Xiao-Ying
    Ren, Chuan-Lun
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1601 - 1607
  • [2] Semantically-informed distance and similarity measures for paraphrase plagiarism identification
    Alvarez-Carmona, Miguel A.
    Franco-Salvador, Marc
    Villatoro-Tello, Esau
    Montes-y-Gomez, Manuel
    Rosso, Paolo
    Villasenor-Pineda, Luis
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 34 (05) : 2983 - 2990
  • [3] Relevance of Similarity Measures Usage for Paraphrase Detection
    Vrbanec, Tedo
    Mestrovic, Ana
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (KDIR), VOL 1:, 2021, : 129 - 138
  • [4] The evaluation of sentence similarity measures
    Achananuparp, Palakorn
    Hu, Xiaohua
    Shen, Xiajiong
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 305 - +
  • [5] Using Fuzzy Set Similarity in Sentence Similarity Measures
    Cross, Valerie
    Mokrenko, Valeria
    Crockett, Keeley
    Adel, Naeemeh
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [6] RuPAWS: A Russian Adversarial Dataset for Paraphrase Identification
    Martynov, Nikita
    Krotova, Irina
    Logacheva, Varvara
    Panchenko, Alexander
    Kozlova, Olga
    Semenov, Nikita
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 5683 - 5691
  • [7] Constructing a Turkish Corpus for Paraphrase Identification and Semantic Similarity
    Eyecioglu, Asli
    Keller, Bill
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 588 - 599
  • [9] Attribute Value-Range Detection in Identification of Paraphrase Sentence Pairs
    Kumova, Senem
    Karaoglan, Bahar
    Kisla, Tarik
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1393 - 1396
  • [10] Paraphrase Identification Between Two Sentence Using Support Vector Machine
    Saputro, Wahyu Faqih
    Djamal, Esmeralda C.
    Ilyas, Ridwan
    PROCEEDING OF 2019 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI), 2019, : 406 - 411