A hybrid lexicon-based and neural approach for explainable polarity detection

被引:4
|
作者
Polignano, Marco [1 ]
Basile, Valerio [2 ]
Basile, Pierpaolo [1 ]
Gabrieli, Giuliano [3 ]
Vassallo, Marco [3 ]
Bosco, Cristina [2 ]
机构
[1] Univ Bari Aldo Mom, Via E Orabona 4, I-70125 Bari, Apulia, Italy
[2] Univ Turin, Via Giuseppe Verdi 8, I-10124 Turin, Piemonte, Italy
[3] CREA Res Ctr Agr Policies & Bioecon, Rome, Italy
关键词
Sentiment analysis; Polarity detection; Lexicon; WMAL; BERT; Explanation; Deep learning; Machine learning;
D O I
10.1016/j.ipm.2022.103058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we propose BERT-WMAL, a hybrid model that brings together information coming from data through the recent transformer deep learning model and those obtained from a polarized lexicon. The result is a model for sentence polarity that manages to have performances comparable with those at the state-of-the-art, but with the advantage of being able to provide the end-user with an explanation regarding the most important terms involved with the provided prediction. The model has been evaluated on three polarity detection Italian dataset, i.e., SENTIPOLC, AGRITREND and ABSITA. While the first contains 7,410 tweets released for training and 2,000 for testing, the second and the third respectively include 1,000 tweets without splitting, and 2,365 reviews for training, 1,171 for testing. The use of lexicon-based information proves to be effective in terms of the F1 measure since it shows an improvement of F1 score on all the observed dataset: from 0.664 to 0.669 (i.e, 0.772%) on AGRITREND, from 0.728 to 0.734 (i.e., 0.854%) on SENTIPOLC and from 0.904 to 0.921 (i.e, 1.873%) on ABSITA. The usefulness of this model not only depends on its effectiveness in terms of the F1 measure, but also on its ability to generate predictions that are more explainable and especially convincing for the end-users. We evaluated this aspect through a user study involving four native Italian speakers, each evaluating 64 sentences with associated explanations. The results demonstrate the validity of this approach based on a combination of weights of attention extracted from the deep learning model and the linguistic knowledge stored in the WMAL lexicon. These considerations allow us to regard the approach provided in this paper as a promising starting point for further works in this research area.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A polarity calculation approach for lexicon-based Turkish sentiment analysis
    Yurtalan, Gokhan
    Koyuncu, Murat
    Turhan, Cigdem
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1325 - 1339
  • [2] A lexicon-based approach for hate speech detection
    School of Information Science and Engineering, Central South University, Changsha, China
    不详
    [J]. Int. J. Multimedia Ubiquitous Eng., 4 (215-230):
  • [3] Towards the Lexicon-Based Sentiment Analysis of Polish Texts: Polarity Lexicon
    Haniewicz, Konstanty
    Rutkowski, Wojciech
    Adamczyk, Magdalena
    Kaczmarek, Monika
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 286 - 295
  • [4] A semantic approach in the lexicon-based feature selection for emotion detection
    Gonzalez-Guerra, Harold
    Simon-Cuevas, Alfredo
    Perea-Ortega, Jose M.
    Olivas, Jose A.
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2021, (67): : 115 - 126
  • [5] A Lexicon-Based Graph Neural Network for Chinese NER
    Gui, Tao
    Zou, Yicheng
    Zhang, Qi
    Peng, Minlong
    Fu, Jinlan
    Wei, Zhongyu
    Huang, Xuanjing
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1040 - 1050
  • [6] Using Hybrid-Stemming Approach to Enhance Lexicon-based Sentiment Analysis in Arabic
    Awwad, Hunaida
    Alpkocak, Adil
    [J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 229 - 235
  • [7] Effective lexicon-based approach for Urdu sentiment analysis
    Neelam Mukhtar
    Mohammad Abid Khan
    [J]. Artificial Intelligence Review, 2020, 53 : 2521 - 2548
  • [8] Mining Comparative Opinions in Portuguese: A Lexicon-based Approach
    Kansaon, Daniel
    Brandão, Michele A.
    Reis, Julio C. S.
    Benevenuto, Fabrício
    [J]. Journal of the Brazilian Computer Society, 2024, 30 (01) : 347 - 362
  • [9] Sentiment strength detection with a context-dependent lexicon-based convolutional neural network
    Huang, Minghui
    Xie, Haoran
    Rao, Yanghui
    Feng, Jingrong
    Wang, Fu Lee
    [J]. INFORMATION SCIENCES, 2020, 520 : 389 - 399
  • [10] A Lexicon-based Collaborative Filtering Approach for Recommendation Systems
    Deac-Petrusel, Mara
    [J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 203 - 210