Detecting Paraphrases for Portuguese using Word and Sentence Embeddings

被引:3
|
作者
Souza, Marlo [1 ]
Sanches, Leandro M. P. [1 ]
机构
[1] Univ Fed Bahia, Salvador, BA, Brazil
来源
LINGUAMATICA | 2018年 / 10卷 / 02期
关键词
Paraphrase Identification; Semantic Textual Similarity; Sentence Embeddings;
D O I
10.21814/lm.10.2.286
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Paraphrase detection/identification is the task of determining whether two or more sentences of arbitrary length possess the same meaning. Methods to solve this task have many potential applications in Natural Language Processing systems. This work investigates the combination of different methods of sentence representation in a vector space model of language and linear classifiers to the problem of paraphrase identification for the Portuguese language. The results obtained in this work are inferior to those obtained for the related task of recognizing textual entailment in the ASSIN evaluation for the Portuguese language, but we point out that in this work we investigate the application of sentence embeddings to the problem of paraphrase detection, as such other features usually explored in systems for this task may be trivially incorporated into our method to improve performance.
引用
收藏
页码:31 / 44
页数:14
相关论文
共 50 条
  • [1] Detecting ongoing events using contextual word and sentence embeddings
    Maisonnave, Mariano
    Delbianco, Fernando
    Tohme, Fernando
    Maguitman, Ana
    Milios, Evangelos
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 209
  • [2] Retrofitting Contextualized Word Embeddings with Paraphrases
    Shi, Weijia
    Chen, Muhao
    Zhou, Pei
    Chang, Kai-Wei
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1198 - 1203
  • [3] Single document summarization using word and sentence embeddings
    Ayana
    PROCEEDINGS OF THE 2015 JOINT INTERNATIONAL MECHANICAL, ELECTRONIC AND INFORMATION TECHNOLOGY CONFERENCE (JIMET 2015), 2015, 10 : 523 - 526
  • [4] Exploring fake news identification using word and sentence embeddings
    Priyanga, V. T.
    Sanjanasri, J. P.
    Menon, Vijay Krishna
    Gopalakrishnan, E. A.
    Soman, K. P.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (05) : 5441 - 5448
  • [5] Learning Word and Sentence Embeddings Using a Generative Convolutional Network
    Vargas-Ocampo, Edgar
    Roman-Rangel, Edgar
    Hermosillo-Valadez, Jorge
    PATTERN RECOGNITION, 2018, 10880 : 135 - 144
  • [6] Developing a sentence level fairness metric using word embeddings
    Ahmed Izzidien
    Stephen Fitz
    Peter Romero
    Bao S. Loe
    David Stillwell
    International Journal of Digital Humanities, 2023, 5 (2-3) : 95 - 130
  • [7] Enhancing Sentence Simplification in Portuguese: Leveraging Paraphrases, Context, and Linguistic Features
    Scalercio, Arthur
    Finatto, Maria Jose
    Paes, Aline
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 15076 - 15091
  • [8] Improving Implicit Stance Classification in Tweets Using Word and Sentence Embeddings
    Schaefer, Robin
    Stede, Manfred
    ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2019, 2019, 11793 : 299 - 307
  • [9] Carrier Sentence Selection with Word and Context Embeddings
    Yeung, Chak Yan
    Lee, John
    Tsou, Benjamin
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 439 - 444
  • [10] Using Paraphrases to Study Properties of Contextual Embeddings
    Burdick, Laura
    Kummerfeld, Jonathan K.
    Mihalcea, Rada
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4558 - 4568