Sentiment analysis in Turkish: Supervised, semi-supervised, and unsupervised techniques

被引:6
|
作者
Aydin, Cem Rifki [1 ]
Gungor, Tunga [1 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
关键词
Sentiment analysis; Opinion mining; Machine learning; Text classification; Morphological analysis; CLASSIFIERS;
D O I
10.1017/S1351324920000200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although many studies on sentiment analysis have been carried out for widely spoken languages, this topic is still immature for Turkish. Most of the works in this language focus on supervised models, which necessitate comprehensive annotated corpora. There are a few unsupervised methods, and they utilize sentiment lexicons either built by translating from English lexicons or created based on corpora. This results in improper word polarities as the language and domain characteristics are ignored. In this paper, we develop unsupervised (domain-independent) and semi-supervised (domain-specific) methods for Turkish, which are based on a set of antonym word pairs as seeds. We make a comprehensive analysis of supervised methods under several feature weighting schemes. We then form ensemble of supervised classifiers and also combine the unsupervised and supervised methods. Since Turkish is an agglutinative language, we perform morphological analysis and use different word forms. The methods developed were tested on two datasets having different styles in Turkish and also on datasets in English to show the portability of the approaches across languages. We observed that the combination of the unsupervised and supervised approaches outperforms the other methods, and we obtained a significant improvement over the state-of-the-art results for both Turkish and English.
引用
收藏
页码:455 / 483
页数:29
相关论文
共 50 条
  • [31] Semi-supervised discriminant analysis
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    [J]. 2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 222 - 228
  • [32] A Semi-supervised Learning Approach for Microblog Sentiment Classification
    Yu, Zhiwei
    Wong, Raymond K.
    Chi, Chi-Hung
    Chen, Fang
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 339 - 344
  • [33] Semi-supervised Component Analysis
    Watanabe, Kenji
    Wada, Toshikazu
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 3011 - 3016
  • [34] Semi-supervised target-oriented sentiment classification
    Xu, Weidi
    Tan, Ying
    [J]. NEUROCOMPUTING, 2019, 337 : 120 - 128
  • [35] Semi-supervised sentiment clustering on natural language texts
    Frigau, Luca
    Romano, Maurizio
    Ortu, Marco
    Contu, Giulia
    [J]. STATISTICAL METHODS AND APPLICATIONS, 2023, 32 (04): : 1239 - 1257
  • [36] Semi-supervised sentiment clustering on natural language texts
    Luca Frigau
    Maurizio Romano
    Marco Ortu
    Giulia Contu
    [J]. Statistical Methods & Applications, 2023, 32 : 1239 - 1257
  • [37] Multimodal, Semi-supervised and Unsupervised web content credibility analysis Frameworks
    Saini, Naman
    Singhal, Mukul
    Tanwar, Mukul
    Meel, Priyanka
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 948 - 955
  • [38] Leveraging Emotional Consistency for Semi-supervised Sentiment Classification
    Minh Luan Nguyen
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 369 - 381
  • [39] Unsupervised and semi-supervised Lagrangian support vector machines
    Zhao, Kun
    Tian, Ying-Jie
    Deng, Nai-Yang
    [J]. COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 882 - 889
  • [40] Unsupervised identification of points of interest for semi-supervised learning
    Frigui, H
    [J]. FUZZ-IEEE 2005: Proceedings of the IEEE International Conference on Fuzzy Systems: BIGGEST LITTLE CONFERENCE IN THE WORLD, 2005, : 91 - 96