AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets

被引:81
|
作者
Al-Twairesh, Nora [1 ]
Al-Khalifa, Hend [1 ]
Al-Salman, AbdulMalik [1 ]
Al-Ohali, Yousef [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
关键词
Sentiment Analysis; Arabic NLP; Corpus Sentiment Annotation; Arabic tweets; Saudi Dialect; RESOURCES;
D O I
10.1016/j.procs.2017.10.094
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Arabic Sentiment Analysis is an active research area these days. However, the Arabic language still lacks sufficient language resources to enable the tasks of sentiment analysis. In this paper, we present the details of collecting and constructing a large dataset of Arabic tweets. The techniques used in cleaning and pre-processing the collected dataset are explained. A corpus of Arabic tweets annotated for sentiment analysis was extracted from this dataset. The corpus consists mainly of tweets written in Modern Standard Arabic and the Saudi dialect. The corpus was manually annotated for sentiment. The annotation process is explained in detail and the challenges during the annotation are highlighted. The corpus contains 17,573 tweets labelled with four labels for sentiment: positive, negative, neutral and mixed. Baseline experiments were conducted to provide benchmark results for future work. (c) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:63 / 72
页数:10
相关论文
共 50 条
  • [21] Detecting Epidemic Diseases Using Sentiment Analysis of Arabic Tweets
    Baker, Qanita Bani
    Shatnawi, Farah
    Rawashdeh, Saif
    Al-Smadi, Mohammad
    Jararweh, Yaser
    [J]. JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2020, 26 (01) : 50 - 70
  • [22] Sentiment analysis of Arabic tweets using text mining techniques
    Al-Horaibi, Lamia
    Khan, Muhammad Badruddin
    [J]. FIRST INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2016, 0011
  • [23] Sentiment Analysis of Arabic Tweets Regarding Distance Learning in Saudi Arabia during the COVID-19 Pandemic
    Aljabri, Malak
    Chrouf, Sara Mhd. Bachar
    Alzahrani, Norah A.
    Alghamdi, Leena
    Alfehaid, Reem
    Alqarawi, Reem
    Alhuthayfi, Jawaher
    Alduhailan, Nouf
    [J]. SENSORS, 2021, 21 (16)
  • [24] Sentiment Analysis Model for Fake News Identification in Arabic Tweets
    Sawan, Aktham
    Thaher, Thaer
    Abu-el-rub, Noor
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [25] Surface and Deep Features Ensemble for Sentiment Analysis of Arabic Tweets
    Al-Twairesh, Nora
    Al-Negheimish, Hadeel
    [J]. IEEE ACCESS, 2019, 7 : 84122 - 84131
  • [26] AraSenTi: Large-Scale Twitter-Specific Arabic Sentiment Lexicons
    Al-Twairesh, Nora
    Al-Khalifa, Hend
    Al-Salman, AbdulMalik
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 697 - 705
  • [27] Explorations in Sentiment Mining for Arabic and English Tweets
    Ahmad, Tariq
    Ramsay, Allan
    Ahmed, Hanady
    [J]. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 16 - 24
  • [28] Public perception of the Chinese president’s visit to Saudi Arabia and the China–Arab Summit: sentiment analysis of Arabic tweets
    Ahmed A. M. Hassan
    [J]. Social Network Analysis and Mining, 14
  • [29] Dragonfly Optimization with Deep Learning Enabled Sentiment Analysis for Arabic Tweets
    Mashraqi, Aisha M.
    Halawani, Hanan T.
    [J]. Computer Systems Science and Engineering, 2023, 46 (02): : 2555 - 2570
  • [30] Improving Sentiment Analysis of Arabic Tweets by One-way ANOVA
    Alassaf, Manar
    Qamar, Ali Mustafa
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 2849 - 2859