Semi-automatic sentiment analysis based on topic modeling

被引:5
|
作者
Sokhin, Timur [1 ]
Butakov, Nikolay [1 ]
机构
[1] ITMO Univ, 49 Kronverksky Pr, St Petersburg 197101, Russia
关键词
Sentiment analysis; Topic modeling; Semi-supervised learning; ARTM;
D O I
10.1016/j.procs.2018.08.286
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Sentiment is an important feature of natural language. It is used to understand semantic of texts and opinion of people. There are many practical applications, which require to extract sentiment from texts: advertising analytics, interactive chat bots, opinion mining Today, different supervised techniques are used to extract sentiment from texts which require large manually labeled datasets that are expensive and time consuming to build. Moreover, such datasets should cover vocabularies and patterns of use of different contexts. Additionally, the efficiency of supervised methods trained on a well-written texts can dramatically decrease on users' texts from social media due to typos, slang, short length of sentences. To solve these problems and to reduce human involvement, we propose semi-supervised sentiment analysis method based on topic modeling with Additive Regularization. To evaluate the efficiency of this method we applied it to several open-source datasets for which sentiment labels are available. The study shows promising results in terms of f1-score with minimal human involvement. (C) 2018 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by/nc-nd/3.0/) Peer-review under responsibility of the scientific committee of the 7th International Young Scientist Conference on Computational Science.
引用
收藏
页码:284 / 292
页数:9
相关论文
共 50 条
  • [1] Semi-automatic terminology ontology learning based on topic modeling
    Rani, Monika
    Dhar, Amit Kumar
    Vyas, O. P.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 63 : 108 - 125
  • [2] Semi-automatic construction of topic ontologies
    Fortuna, Blaz
    Mladenic, Dunja
    Grobelnik, Marko
    [J]. SEMANTICS, WEB AND MINING, 2006, 4289 : 121 - 131
  • [3] Arabic Sentiment Analysis based on Topic Modeling
    Bekkali, Mohammed
    Lachkar, Abdelmonaime
    [J]. PROCEEDINGS OF THE SECOND CONFERENCE OF THE MOROCCAN CLASSIFICATION SOCIETY: NEW CHALLENGES IN DATA SCIENCES (SMC '2019), 2019, : 117 - 122
  • [4] Semi-automatic training set construction for supervised sentiment analysis in political contexts
    Martin-Gutierrez, S.
    Losada, J. C.
    Benito, R. M.
    [J]. 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 715 - 720
  • [5] Semi-automatic modeling by constraint acquisition
    Coletta, R
    Bessière, C
    O'Sullivan, B
    Freuder, EC
    O'Connell, S
    Quinqueton, J
    [J]. PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING - CP 2003, PROCEEDINGS, 2003, 2833 : 812 - 816
  • [6] Semi-automatic construction of a named entity dictionary for entity-based sentiment analysis in social media
    Song, Yeongkil
    Jeong, Seokwon
    Kim, Harksoo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (09) : 11319 - 11329
  • [7] Constraint acquisition as semi-automatic modeling
    Coletta, R
    Bessiere, C
    O'Sullivan, B
    Freuder, EC
    O'Connell, S
    Quinqueton, J
    [J]. RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XX, 2004, : 111 - 124
  • [8] Semi-automatic construction of a named entity dictionary for entity-based sentiment analysis in social media
    Yeongkil Song
    Seokwon Jeong
    Harksoo Kim
    [J]. Multimedia Tools and Applications, 2017, 76 : 11319 - 11329
  • [9] Information Modeling: The Need of Semi-Automatic Model Analysis and Transformation
    Weller, Jens
    Juhrisch, Martin
    Grossmann, Knut
    [J]. AMCIS 2010 PROCEEDINGS, 2010,
  • [10] SEMI-AUTOMATIC ANALYSIS OF NYSTAGMUS
    SCHERER, H
    HAUSMANN, G
    [J]. ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 1976, 213 (02): : 484 - 485