Statistical and Semantic Features to Measure Sentence Similarity in Portuguese

被引:6
|
作者
Pinheiro, Anderson [1 ]
Ferreira, Rafael [1 ]
Ferreira, Maverick Andre D. [1 ]
Rolim, Vitor B. [1 ]
Tenorio, Joao Vitor S. [2 ]
机构
[1] Univ Fed Rural Pernambuco, Dept Estat & Informat, Recife, PE, Brazil
[2] Univ Fed Minas Gerais, Dept Ciencia Comp, Belo Horizonte, MG, Brazil
关键词
Sentence Similarity; Natural Language Processing; Text Mining;
D O I
10.1109/BRACIS.2017.40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A sentence similarity measure is an important field for different applications of text mining. In recent literature, it is possible to find several similarity measures between sentences in English; however, it lacks measures for Portuguese. In addition, one of the main issues to assess sentence similarity is to identify word meaning. In this context, this work aims to present a new approach to measure the similarity between sentences written in Portuguese using statistical and deep learning features to overcome the meaning problems. The results showed that our method obtained better results when compared to the measures proposed in ASSIN 2016 competition.
引用
收藏
页码:342 / 347
页数:6
相关论文
共 50 条
  • [1] FAST: A Fuzzy Semantic Sentence Similarity Measure
    Chandran, David
    Crockett, Keeley
    Mclean, David
    Bandar, Zuhair
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [2] Eigenvalue Based Features For Semantic Sentence Similarity
    Vardasbi, Ali
    Faili, Heshaam
    Asadpour, Masoud
    [J]. 2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 184 - 189
  • [3] Combining Common Words And Semantic Features For Sentence Similarity
    Prasad, M. KrishnaSiva
    Sharma, Poonam
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [4] A novel sentence similarity measure for semantic-based expert systems
    Lee, Ming Che
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 6392 - 6399
  • [5] Semantic similarity measure of polish nouns based on linguistic features
    Piasecki, Maciej
    Broda, Bartosz
    [J]. BUSINESS INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4439 : 381 - +
  • [6] Sentence Similarity Using Syntactic and Semantic Features for Multi-document Summarization
    Anjaneyulu, M.
    Sarma, S. S. V. N.
    Reddy, P. Vijaya Pal
    Chander, K. Prem
    Nagaprasad, S.
    [J]. INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 471 - 485
  • [7] Sentence Similarity Measurement with Convolutional Neural Networks Using Semantic and Syntactic Features
    Zhang, Shiru
    Liang, Zhiyao
    Lin, Jian
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (02): : 943 - 957
  • [8] Sequential Sentence Embeddings for Semantic Similarity
    Carta, Antonio
    Bacciu, Davide
    [J]. 2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019), 2019, : 1354 - 1361
  • [9] Word and Sentence Embedding Tools to Measure Semantic Similarity of Gene Ontology Terms by Their Definitions
    Duong, Dat
    Ahmad, Wasi Uddin
    Eskin, Eleazar
    Chang, Kai-Wei
    Li, Jingyi Jessica
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (01) : 38 - 52
  • [10] Semantic similarity, predictability, and models of sentence processing
    Roland, Douglas
    Yun, Hongoak
    Koenig, Jean-Pierre
    Mauner, Gail
    [J]. COGNITION, 2012, 122 (03) : 267 - 279