SentPT: A customized solution for multi-genre sentiment analysis of Portuguese-language texts

被引:1
|
作者
Goularte, Fabio Bif [1 ]
Martins, Bruno Emanuel da Graca [1 ]
Carvalho, Paula Cristina Quaresma da Fonseca [1 ]
Won, Miguel [1 ]
机构
[1] Univ Tecn Lisboa, INESC ID, Inst Super Tecn, Lisbon, Portugal
关键词
Sentiment analysis; Sentiment polarity; Affective knowledge; Transfer learning; Deep learning;
D O I
10.1016/j.eswa.2023.123075
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is a data -driven task, and the resources currently available mostly cover only a couple of text genres in specific contexts. Notably, sentiment analysis advancements have primarily centered on high-resource languages, whereas numerous languages and their speakers are overlooked. This paper introduces SentPT, a novel polarity classifier designed for sentiment analysis of Portuguese-language texts spanning various genres. The aim is to address the gap in multi -genre sentiment analysis by offering a customized solution. To this end, we curate a comprehensive dataset covering different contexts, such as news, literary texts, opinions, comments, social media, and more, followed by preprocessing for consistency. Our proposed classifier adopts a Transfer Learning approach, fine-tuning a BERT model, and is evaluated against a diverse set of texts, including product reviews, literary works, news articles, and game comments. The evaluation employs traditional metrics like precision, recall, and F1 -score, with SentPT demonstrating the best overall performance. Our classifier proves effective for formal and informal texts, outperforming existing systems.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] The Language of Journalism: A Multi-genre Perspective
    Appelman, Alyssa
    [J]. JOURNALISM & MASS COMMUNICATION QUARTERLY, 2014, 91 (03) : 603 - 604
  • [2] AWATIF: A Multi-Genre Corpus for Modern Standard Arabic Subjectivity and Sentiment Analysis
    Abdul-Mageed, Muhammad
    Diab, Mona
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3907 - 3914
  • [3] Book Genre Classification Based on Reviews of Portuguese-Language Literature
    Scofield, Clarisse
    Silva, Mariana O.
    de Melo-Gomes, Luiza
    Moro, Mirella M.
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 188 - 197
  • [4] SANA: A Large Scale Multi-Genre, Multi-Dialect Lexicon for Arabic Subjectivity and Sentiment Analysis
    Abdul-Mageed, Muhammad
    Diab, Mona
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1162 - 1169
  • [5] Exploring high school biology students' discussions of multi-genre texts
    Kloser, Matthew
    Floyd, Catherine
    Spang, Chloe
    Rogers, Megan
    Ottone, Nicholas
    Rice, Matthew
    [J]. INTERNATIONAL JOURNAL OF SCIENCE EDUCATION, 2023, 45 (11) : 895 - 922
  • [6] Sentiment analysis for Chinese reviews of movies in multi-genre based on morpheme-based features and collocations
    Heng-Li Yang
    August F. Y. Chao
    [J]. Information Systems Frontiers, 2015, 17 : 1335 - 1352
  • [7] Sentiment analysis for Chinese reviews of movies in multi-genre based on morpheme-based features and collocations
    Yang, Heng-Li
    Chao, August F. Y.
    [J]. INFORMATION SYSTEMS FRONTIERS, 2015, 17 (06) : 1335 - 1352
  • [9] Semantic Textual Similarity of Portuguese-Language Texts: An Approach Based on the Semantic Inferentialism Model
    Pinheiro, Vladia
    Furtado, Vasco
    Albuquerque, Adriano
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, 2014, 8775 : 183 - 188
  • [10] A survey of sentiment analysis in the Portuguese language
    Denilson Alves Pereira
    [J]. Artificial Intelligence Review, 2021, 54 : 1087 - 1115