RIFT: A Rule Induction Framework for Twitter Sentiment Analysis

被引:30
|
作者
Asghar, Muhammad Zubair [1 ]
Khan, Aurangzeb [2 ]
Khan, Furqan [1 ]
Kundi, Fazal Masud [1 ]
机构
[1] Gomal Univ, Inst Comp & Informat Technol, Dera Ismail Khan, Pakistan
[2] Univ Sci & Technol, Dept Comp Sci, Bannu, Pakistan
关键词
Twitter; Sentiment analysis; Rule induction; Slang; Emoticons; Rough set theory; LEM2;
D O I
10.1007/s13369-017-2770-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rapid evolution of microblogging and the emergence of sites such as Twitter have propelled online communities to flourish by enabling people to create, share and disseminate free-flowing messages and information globally. The exponential growth of product-based user reviews has become an ever-increasing resource playing a key role in emerging Twitter-based sentiment analysis (SA) techniques and applications to collect and analyse customer trends and reviews. Existing studies on supervised black-box sentiment analysis systems do not provide adequate information, regarding rules as to why a certain review was classified to a class or classification. The accuracy in some ways is less than our personal judgement. To address these shortcomings, alternative approaches, such as supervised white-box classification algorithms, need to be developed to improve the classification of Twitter-based microblogs. The purpose of this study was to develop a supervised white-box microblogging SA system to analyse user reviews on certain products using rough set theory (RST)-based rule induction algorithms. RST classifies microblogging reviews of products into positive, negative, or neutral class using different rules extracted from training decision tables using RST-centric rule induction algorithms. The primary focus of this study is also to perform sentiment classification of microblogs (i.e. also known as tweets) of product reviews using conventional, and RST-based rule induction algorithms. The proposed RST-centric rule induction algorithm, namely Learning from Examples Module version: 2, and LEM2 Corpus-based rules (LEM2 CBR),which is an extension of the traditional LEM2 algorithm, are used. Corpus-based rules are generated from tweets, which are unclassified using other conventional LEM2 algorithm rules. Experimental results show the proposed method, when compared with baseline methods, is excellent, with regard to accuracy, coverage and the number of rules employed. The approach using this method achieves an average accuracy of 92.57% and an average coverage of 100%, with an average number of rules of 19.14.
引用
收藏
页码:857 / 877
页数:21
相关论文
共 50 条
  • [21] Pre-processing Framework for Twitter Sentiment Classification
    Dritsas, Elias
    Vonitsanos, Gerasimos
    Livieris, Ioannis E.
    Kanavo, Andreas
    Ilias, Aristidis
    Makris, Christos
    Tsakalidis, Athanasios
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS (AIAI 2019), 2019, 560 : 138 - 149
  • [22] Sentiment Analytics Framework Integrating Twitter and Odoo ERP
    Dussoye, Hirikesh
    Cadersaib, Zarine
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFOCOM TECHNOLOGIES AND UNMANNED SYSTEMS (TRENDS AND FUTURE DIRECTIONS) (ICTUS), 2017, : 145 - 151
  • [23] SENTIMENT ANALYSIS OF THE SYRIAN CONFLICT ON TWITTER
    Lucic, Danijela
    Katalinic, Josip
    Dokman, Tomislav
    [J]. MEDIJSKE STUDIJE-MEDIA STUDIES, 2020, 11 (22): : 46 - 61
  • [24] Analysis of Political Sentiment Orientations on Twitter
    Ansari, Mohd Zeeshan
    Aziz, M. B.
    Siddiqui, M. O.
    Mehra, H.
    Singh, K. P.
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1821 - 1828
  • [25] Clustering and Sentiment Analysis on Twitter Data
    Ahuja, Shreya
    Dubey, Gaurav
    [J]. 2017 2ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATION AND NETWORKS (TEL-NET), 2017, : 420 - 424
  • [26] Sentiment Analysis of Turkish Twitter Data
    Shehu, Harisu Abdullahi
    Tokat, Sezai
    Sharif, Md. Haidar
    Uyaver, Sahin
    [J]. THIRD INTERNATIONAL CONFERENCE OF MATHEMATICAL SCIENCES (ICMS 2019), 2019, 2183
  • [27] Sentiment analysis and Twitter: a game proposal
    Marco Furini
    Manuela Montangero
    [J]. Personal and Ubiquitous Computing, 2018, 22 : 771 - 785
  • [28] Sentiment analysis of multimodal twitter data
    Kumar, Akshi
    Garg, Geetanjali
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (17) : 24103 - 24119
  • [29] Exploring Sentiment Analysis on Twitter Data
    Venugopalan, Manju
    Gupta, Deepa
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2015, : 241 - 247
  • [30] Contextual semantics for sentiment analysis of Twitter
    Saif, Hassan
    He, Yulan
    Fernandez, Miriam
    Alani, Harith
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (01) : 5 - 19