A System for Extracting Sentiment from Large-Scale Arabic Social Data

被引:2
|
作者
Wang, Hao [1 ]
Bommireddipalli, Vijay R. [1 ]
Hanafy, Ayman [2 ]
Bahgat, Mohamed [2 ]
Noeman, Sara [2 ]
Emam, Ossama S. [2 ]
机构
[1] IBM Corp, Silicon Valley Lab, San Jose, CA 95120 USA
[2] IBM Corp, Cairo Human Language Technol Grp, Cairo, Egypt
关键词
Arabic; Sentiment Analysis; Social Data; Big Data;
D O I
10.1109/ACLing.2015.17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media data in Arabic language is becoming more and more abundant. It is a consensus that valuable information lies in social media data. Mining this data and making the process easier are gaining momentum in the industries. This paper describes an enterprise system we developed for extracting sentiment from large volumes of social data in Arabic dialects. First, we give an overview of the Big Data system for information extraction from multilingual social data from a variety of sources. Then, we focus on the Arabic sentiment analysis capability that was built on top of the system including normalizing written Arabic dialects, building sentiment lexicons, sentiment classification, and performance evaluation. Lastly, we demonstrate the value of enriching sentiment results with user profiles in understanding sentiments of a specific user group.
引用
收藏
页码:71 / 77
页数:7
相关论文
共 50 条
  • [1] Extracting insights from social media with large-scale matrix approximations
    Sindhwani, V.
    Ghoting, A.
    Ting, E.
    Lawrence, R.
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2011, 55 (05)
  • [2] AraSenTi: Large-Scale Twitter-Specific Arabic Sentiment Lexicons
    Al-Twairesh, Nora
    Al-Khalifa, Hend
    Al-Salman, AbdulMalik
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 697 - 705
  • [3] Automatic expandable large-scale sentiment lexicon of Modern Standard Arabic and Colloquial
    Ibrahim, Hossam S.
    Abdou, Sherif M.
    Gheith, Mervat
    2015 FIRST INTERNATIONAL CONFERENCE ON ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2015): ADVANCES IN ARABIC COMPUTATIONAL LINGUISTICS, 2015, : 94 - 99
  • [4] Temporal Sentiment Tracking and Analysis on Large-scale Social Events
    Hazimeh, Hussein
    Harissa, Mohammad
    Mugellini, Elena
    Abou Khaled, Omar
    2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 17 - 21
  • [5] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Pandarachalil, Rafeeque
    Sendhilkumar, Selvaraju
    Mahalakshmi, G. S.
    COGNITIVE COMPUTATION, 2015, 7 (02) : 254 - 262
  • [6] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Rafeeque Pandarachalil
    Selvaraju Sendhilkumar
    G. S. Mahalakshmi
    Cognitive Computation, 2015, 7 : 254 - 262
  • [7] Extracting large-scale knowledge bases from the web
    Kumar, R
    Raghavan, P
    Rajagopalan, S
    Tomkins, A
    PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 639 - 650
  • [8] Extracting Domain-specific Concepts from Large-scale Linked Open Data
    Kume, Satoshi
    Kozaki, Kouji
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE GRAPHS (IJCKG 2021), 2021, : 28 - 37
  • [9] Extracting Human Activity Areas from Large-Scale Spatial Data with Varying Densities
    Shen, Xiaoqi
    Shi, Wenzhong
    Liu, Zhewei
    Zhang, Anshu
    Wang, Lukang
    Zeng, Fanxin
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (07)
  • [10] A strategy for extracting and analyzing large-scale quantitative epistatic interaction data
    Sean R Collins
    Maya Schuldiner
    Nevan J Krogan
    Jonathan S Weissman
    Genome Biology, 7