Building and evaluating resources for sentiment analysis in the Greek language

被引:0
|
作者
Adam Tsakalidis
Symeon Papadopoulos
Rania Voskaki
Kyriaki Ioannidou
Christina Boididou
Alexandra I. Cristea
Maria Liakata
Yiannis Kompatsiaris
机构
[1] University of Warwick,Department of Computer Science
[2] CERTH,Information Technologies Institute
[3] Centre for the Greek Language,Laboratory of Translation and Natural Language Processing
[4] Aristotle University of Thessaloniki,Department of Computer Science
[5] The Alan Turing Institute,undefined
[6] University of Durham,undefined
来源
关键词
Sentiment lexicon; Greek language; Word embeddings; Sentiment analysis; Natural language processing; Opinion mining; Emotion analysis; Sarcasm detection;
D O I
暂无
中图分类号
学科分类号
摘要
Sentiment lexicons and word embeddings constitute well-established sources of information for sentiment analysis in online social media. Although their effectiveness has been demonstrated in state-of-the-art sentiment analysis and related tasks in the English language, such publicly available resources are much less developed and evaluated for the Greek language. In this paper, we tackle the problems arising when analyzing text in such an under-resourced language. We present and make publicly available a rich set of such resources, ranging from a manually annotated lexicon, to semi-supervised word embedding vectors and annotated datasets for different tasks. Our experiments using different algorithms and parameters on our resources show promising results over standard baselines; on average, we achieve a 24.9% relative improvement in F-score on the cross-domain sentiment analysis task when training the same algorithms with our resources, compared to training them on more traditional feature sources, such as n-grams. Importantly, while our resources were built with the primary focus on the cross-domain sentiment analysis task, they also show promising results in related tasks, such as emotion analysis and sarcasm detection.
引用
收藏
页码:1021 / 1044
页数:23
相关论文
共 50 条
  • [1] Building and evaluating resources for sentiment analysis in the Greek language
    Tsakalidis, Adam
    Papadopoulos, Symeon
    Voskaki, Rania
    Ioannidou, Kyriaki
    Boididou, Christina
    Cristea, Alexandra I.
    Liakata, Maria
    Kompatsiaris, Yiannis
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (04) : 1021 - 1044
  • [2] Sentiment Analysis for the Greek Language
    Spatiotis, Nikolaos
    Mporas, Iosif
    Paraskevas, Michael
    Perikos, Isidoros
    [J]. 20TH PAN-HELLENIC CONFERENCE ON INFORMATICS (PCI 2016), 2016,
  • [3] Is there a language of sentiment? An analysis of lexical resources for sentiment analysis
    Ann Devitt
    Khurshid Ahmad
    [J]. Language Resources and Evaluation, 2013, 47 : 475 - 511
  • [4] Is there a language of sentiment? An analysis of lexical resources for sentiment analysis
    Devitt, Ann
    Ahmad, Khurshid
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (02) : 475 - 511
  • [5] Exploring Resources for Sentiment Analysis in Portuguese Language
    de Freitas, Larissa A.
    Vieira, Renata
    [J]. 2015 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2015), 2015, : 152 - 156
  • [6] A new approach applying Sentiment Analysis in Greek Language
    Rokkou, Georgia
    Spatiotis, Nikolaos
    Triantafyllou, Vasilis
    Paraskevas, Michael
    [J]. 25TH PAN-HELLENIC CONFERENCE ON INFORMATICS WITH INTERNATIONAL PARTICIPATION (PCI2021), 2021, : 235 - 241
  • [7] Examining the Impact of Discretization Technique on Sentiment Analysis for the Greek Language
    Spatiotis, Nikolaos
    Perikos, Isidoros
    Mporas, Iosif
    Paraskevas, Michael
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2019, : 365 - 370
  • [8] Examining the Impact of Feature Selection on Sentiment Analysis for the Greek Language
    Spatiotis, Nikolaos
    Paraskevas, Michael
    Perikos, Isidoros
    Mporas, Iosif
    [J]. SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 353 - 361
  • [9] Dataset of sentiment tagged language resources for Bosnian language
    Jahic, Sead
    Vicic, Jernej
    [J]. DATA IN BRIEF, 2024, 53
  • [10] Building Large Arabic Multi-domain Resources for Sentiment Analysis
    ElSahar, Hady
    El-Beltagy, Samhaa R.
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT II, 2015, 9042 : 23 - 34