A System for Extracting Sentiment from Large-Scale Arabic Social Data

被引:2
|
作者
Wang, Hao [1 ]
Bommireddipalli, Vijay R. [1 ]
Hanafy, Ayman [2 ]
Bahgat, Mohamed [2 ]
Noeman, Sara [2 ]
Emam, Ossama S. [2 ]
机构
[1] IBM Corp, Silicon Valley Lab, San Jose, CA 95120 USA
[2] IBM Corp, Cairo Human Language Technol Grp, Cairo, Egypt
关键词
Arabic; Sentiment Analysis; Social Data; Big Data;
D O I
10.1109/ACLing.2015.17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media data in Arabic language is becoming more and more abundant. It is a consensus that valuable information lies in social media data. Mining this data and making the process easier are gaining momentum in the industries. This paper describes an enterprise system we developed for extracting sentiment from large volumes of social data in Arabic dialects. First, we give an overview of the Big Data system for information extraction from multilingual social data from a variety of sources. Then, we focus on the Arabic sentiment analysis capability that was built on top of the system including normalizing written Arabic dialects, building sentiment lexicons, sentiment classification, and performance evaluation. Lastly, we demonstrate the value of enriching sentiment results with user profiles in understanding sentiments of a specific user group.
引用
收藏
页码:71 / 77
页数:7
相关论文
共 50 条
  • [31] Logo information recognition in large-scale social media data
    Fanglin Wang
    Shuhan Qi
    Ge Gao
    Sicheng Zhao
    Xiangyu Wang
    Multimedia Systems, 2016, 22 : 63 - 73
  • [32] EVOLVE: HPC and Cloud Enhanced Testbed for Extracting Value from Large-scale Diverse Data Invited Paper
    Chazapis, Antony
    Acquaviva, Jean-Thomas
    Bilas, Angelos
    Gardikis, Georgios
    Kozanitis, Christos
    Louloudakis, Stelios
    Huy-Nam Nguyen
    Pinto, Christian
    Scharl, Arno
    Soudris, Dimitrios
    PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2021 (CF 2021), 2021, : 200 - 205
  • [33] Extracting socio-cultural networks of the Sudan from open-source, large-scale text data
    Diesner, Jana
    Carley, Kathleen M.
    Tambayong, Laurent
    COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2012, 18 (03) : 328 - 339
  • [34] SocialGate: Managing Large-Scale Social Data on Home Gateways
    Koll, David
    Lechler, Dieter
    Fu, Xiaoming
    2017 IEEE 25TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2017,
  • [35] Distributed Large-Scale Data Collection in Online Social Networks
    Efstathiades, Hariton
    Antoniades, Demetris
    Pallis, George
    Dikaiakos, Marios D.
    2016 IEEE 2ND INTERNATIONAL CONFERENCE ON COLLABORATION AND INTERNET COMPUTING (IEEE CIC), 2016, : 373 - 380
  • [36] Logo information recognition in large-scale social media data
    Wang, Fanglin
    Qi, Shuhan
    Gao, Ge
    Zhao, Sicheng
    Wang, Xiangyu
    MULTIMEDIA SYSTEMS, 2016, 22 (01) : 63 - 73
  • [37] Extracting socio-cultural networks of the Sudan from open-source, large-scale text data
    Jana Diesner
    Kathleen M. Carley
    Laurent Tambayong
    Computational and Mathematical Organization Theory, 2012, 18 : 328 - 339
  • [38] ACOUSTICS, CONTENT AND GEO-INFORMATION BASED SENTIMENT PREDICTION FROM LARGE-SCALE NETWORKED VOICE DATA
    Ren, Zhu
    Jia, Jia
    Guo, Quan
    Zhang, Kuo
    Cai, Lianhong
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,
  • [39] Extracting Emergent Semantics from Large-Scale User-Generated Content
    Kompatsiaris, Ioannis
    Diplaris, Sotiris
    Papadopoulos, Symeon
    ICT INNOVATIONS 2011, 2011, 150 : 27 - 37
  • [40] Efficacy of extracting indices from large-scale acoustic recordings to monitor biodiversity
    Buxton, Rachel T.
    McKenna, Megan F.
    Clapp, Mary
    Meyer, Erik
    Stabenau, Erik
    Angeloni, Lisa M.
    Crooks, Kevin
    Wittemyer, George
    CONSERVATION BIOLOGY, 2018, 32 (05) : 1174 - 1184