An Annotated Corpus for Turkish Sentiment Analysis at Sentence Level

被引:3
|
作者
Omurca, Sevinc Ihan [1 ]
Ekinci, Ekin
Turkmen, Hazal
机构
[1] Fac Engn, Dept Comp Engn, TR-41380 Kocaeli, Turkey
关键词
Aspect based sentiment analysis; Turkish Language; text mining; morphological analysis; annotation; !text type='JSON']JSON[!/text] data;
D O I
10.1109/IDAP.2017.8090212
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid growth of unstructured data accessible via web, managing these data and finding undiscovered information in huge dataset become a necessary task. Consequently text mining, which can be defined as gleaning important information from natural language text, has emerged. In this study, in order to facilitate information management for aspect based sentiment analysis studies, a Turkish sentiment corpus, which is comprised of user reviews and is annotated semi-automatically, is constructed. In the constructed corpus, the root form of the words, the usage (aspect/multiaspect/seedsentiment/absent) of these words, Part of Speech (POS) tags and their polarities are defined. Turkish hotel review dataset which contains 1000 reviews and 5364 sentences for this study was crawled from a web source. The system takes reviews, aspect and seedsentiment lists and returns JSON data structures of the annotated corpus. In this paper, both we provide a ready to use dataset for developing aspect based sentiment analysis applications and we make this dataset easy to use for Java applications by creating JSON data.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Amna Altaf
    Muhammad Waqas Anwar
    Muhammad Hasan Jamal
    Usama Ijaz Bajwa
    [J]. Multimedia Tools and Applications, 2023, 82 : 41813 - 41839
  • [42] Sentence-Level Sentiment Analysis Using Feature Vectors from Word Embeddings
    Hayashi, Toshitaka
    Fujita, Hamido
    [J]. NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 749 - 758
  • [43] Coordinate Relationship Extraction On Sentence Level In Chinese Corpus
    Sun, Rong
    Liu, Zongtian
    Zhou, Wen
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2013, : 127 - 135
  • [44] An annotated corpus for the analysis of VP ellipsis
    Bos, Johan
    Spenader, Jennifer
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (04) : 463 - 494
  • [45] An annotated corpus for the analysis of VP ellipsis
    Johan Bos
    Jennifer Spenader
    [J]. Language Resources and Evaluation, 2011, 45 : 463 - 494
  • [46] Sentiment Analysis based on Specific Dictionary and Sentence Analysis
    Wang, Xinyue
    Ding, Cheng
    Zheng, Wenxi
    Wu, Min
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ECONOMICS AND MANAGEMENT, EDUCATION, HUMANITIES AND SOCIAL SCIENCES (EMEHSS 2017), 2017, 86 : 6 - 10
  • [47] SFU ReviewSP-NEG: a Spanish corpus annotated with negation for sentiment analysis. A typology of negation patterns
    Salud María Jiménez-Zafra
    Mariona Taulé
    M. Teresa Martín-Valdivia
    L. Alfonso Ureña-López
    M. Antónia Martí
    [J]. Language Resources and Evaluation, 2018, 52 : 533 - 569
  • [48] SFU ReviewSP-NEG: a Spanish corpus annotated with negation for sentiment analysis. A typology of negation patterns
    Maria Jimenez-Zafra, Salud
    Taule, Mariona
    Teresa Martin-Valdivia, M.
    Alfonso Urena-Lopez, L.
    Antonia Marti, M.
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2018, 52 (02) : 533 - 569
  • [49] Annotation of a Corpus of Tweets for Sentiment Analysis
    dos Santos, Allisfrank
    Barros Junior, Jorge Daniel
    Camargo, Heloisa de Arruda
    [J]. COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 294 - 302
  • [50] Sentiment Analysis on (Bengali Horoscope) Corpus
    Ghosal, Tirthankar
    Das, Sajal K.
    Bhattacharjee, Saprativa
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,