A hybrid lexicon-based and neural approach for explainable polarity detection

被引：4

作者：

Polignano, Marco ^{[1
]}

Basile, Valerio ^{[2
]}

Basile, Pierpaolo ^{[1
]}

Gabrieli, Giuliano ^{[3
]}

Vassallo, Marco ^{[3
]}

Bosco, Cristina ^{[2
]}

机构：

[1] Univ Bari Aldo Mom, Via E Orabona 4, I-70125 Bari, Apulia, Italy

[2] Univ Turin, Via Giuseppe Verdi 8, I-10124 Turin, Piemonte, Italy

[3] CREA Res Ctr Agr Policies & Bioecon, Rome, Italy

来源：

INFORMATION PROCESSING & MANAGEMENT | 2022年 / 59卷 / 05期

关键词：

Sentiment analysis; Polarity detection; Lexicon; WMAL; BERT; Explanation; Deep learning; Machine learning;

D O I：

10.1016/j.ipm.2022.103058

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this work, we propose BERT-WMAL, a hybrid model that brings together information coming from data through the recent transformer deep learning model and those obtained from a polarized lexicon. The result is a model for sentence polarity that manages to have performances comparable with those at the state-of-the-art, but with the advantage of being able to provide the end-user with an explanation regarding the most important terms involved with the provided prediction. The model has been evaluated on three polarity detection Italian dataset, i.e., SENTIPOLC, AGRITREND and ABSITA. While the first contains 7,410 tweets released for training and 2,000 for testing, the second and the third respectively include 1,000 tweets without splitting, and 2,365 reviews for training, 1,171 for testing. The use of lexicon-based information proves to be effective in terms of the F1 measure since it shows an improvement of F1 score on all the observed dataset: from 0.664 to 0.669 (i.e, 0.772%) on AGRITREND, from 0.728 to 0.734 (i.e., 0.854%) on SENTIPOLC and from 0.904 to 0.921 (i.e, 1.873%) on ABSITA. The usefulness of this model not only depends on its effectiveness in terms of the F1 measure, but also on its ability to generate predictions that are more explainable and especially convincing for the end-users. We evaluated this aspect through a user study involving four native Italian speakers, each evaluating 64 sentences with associated explanations. The results demonstrate the validity of this approach based on a combination of weights of attention extracted from the deep learning model and the linguistic knowledge stored in the WMAL lexicon. These considerations allow us to regard the approach provided in this paper as a promising starting point for further works in this research area.

引用

页数：20

共 50 条

[1] A polarity calculation approach for lexicon-based Turkish sentiment analysis
Yurtalan, Gokhan
Koyuncu, Murat
Turhan, Cigdem
[J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1325 - 1339
[2] A lexicon-based approach for hate speech detection
School of Information Science and Engineering, Central South University, Changsha, China
不详
[J]. Int. J. Multimedia Ubiquitous Eng., 4 (215-230):
[3] Towards the Lexicon-Based Sentiment Analysis of Polish Texts: Polarity Lexicon
Haniewicz, Konstanty
Rutkowski, Wojciech
Adamczyk, Magdalena
Kaczmarek, Monika
[J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 286 - 295
[4] A semantic approach in the lexicon-based feature selection for emotion detection
Gonzalez-Guerra, Harold
Simon-Cuevas, Alfredo
Perea-Ortega, Jose M.
Olivas, Jose A.
[J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2021, (67): : 115 - 126
[5] A Lexicon-Based Graph Neural Network for Chinese NER
Gui, Tao
Zou, Yicheng
Zhang, Qi
Peng, Minlong
Fu, Jinlan
Wei, Zhongyu
Huang, Xuanjing
[J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1040 - 1050
[6] Using Hybrid-Stemming Approach to Enhance Lexicon-based Sentiment Analysis in Arabic
Awwad, Hunaida
Alpkocak, Adil
[J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 229 - 235
[7] Effective lexicon-based approach for Urdu sentiment analysis
Neelam Mukhtar
Mohammad Abid Khan
[J]. Artificial Intelligence Review, 2020, 53 : 2521 - 2548
[8] Mining Comparative Opinions in Portuguese: A Lexicon-based Approach
Kansaon, Daniel
Brandão, Michele A.
Reis, Julio C. S.
Benevenuto, Fabrício
[J]. Journal of the Brazilian Computer Society, 2024, 30 (01) : 347 - 362
[9] Sentiment strength detection with a context-dependent lexicon-based convolutional neural network
Huang, Minghui
Xie, Haoran
Rao, Yanghui
Feng, Jingrong
Wang, Fu Lee
[J]. INFORMATION SCIENCES, 2020, 520 : 389 - 399
[10] A Lexicon-based Collaborative Filtering Approach for Recommendation Systems
Deac-Petrusel, Mara
[J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 203 - 210

← 1 2 3 4 5 →