On Enhancing the Label Propagation Algorithm for Sentiment Analysis Using Active Learning with an Artificial Oracle

被引:0
|
作者
Yazidi, Anis [1 ]
Hammer, Hugo Lewi [1 ]
Bai, Aleksander [1 ]
Engelstad, Paal [1 ]
机构
[1] Oslo & Akershus Univ Coll Appl Sci, Dept Comp Sci, Oslo, Norway
关键词
Sentiment analysis; Label propagation; Active learning;
D O I
10.1007/978-3-319-19369-4_71
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A core component of Sentiment Analysis is the generation of sentiment lists. Label propagation is equivocally one of the most used approaches for generating sentiment lists based on annotated seed words in a manual manner. Words which are situated many hops away from the seed words tend to get low sentiment values. Such inherent property of the Label Propagation algorithm poses a controversial challenge in sentiment analysis. In this paper, we propose an iterative approach based on the theory of Active Learning [1] that attempts to remedy to this problem without any need for additional manual labeling. Our algorithm is bootstrapped with a limited amount of seeds. Then, at each iteration, a fixed number of "informative words" are selected as new seeds for labeling according to different criteria that we will elucidate in the paper. Subsequently, the Label Propagation is retrained in the next iteration with the additional labeled seeds. A major contribution of this article is that, unlike the theory of Active Learning that prompts the user for additional labeling, we generate the additional seeds with an Artificial Oracle. This is radically different from the main stream of Active Learning Theory that resorts to a human (user) as oracle for labeling those additional seeds. Consequently, we relieve the user from the cumbersome task of manual annotation while still achieving a high performance. The lexicons were evaluated by classifying product and movie reviews. Most of the generated sentiment lexicons using Active learning perform better than the Label Propagation algorithm.
引用
收藏
页码:799 / 810
页数:12
相关论文
共 50 条
  • [1] Khmer Sentiment Lexicon Based on PU Learning and Label Propagation Algorithm
    Li, Chao
    Yan, Xin
    Xu, Guangyi
    Deng, Zhongying
    Mo, Yuanyuan
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (03)
  • [2] Enhancing Sentiment Analysis Using Hybrid Deep Learning
    Ukaihongsar, Watthana
    Jitsakul, Watchareewan
    [J]. PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATION TECHNOLOGY (IC2IT 2022), 2022, 453 : 183 - 193
  • [3] Tag Me a Label with Multi-arm: Active Learning for Telugu Sentiment Analysis
    Mukku, Sandeep Sricharan
    Oota, Subba Reddy
    Mamidi, Radhika
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2017, 2017, 10440 : 355 - 367
  • [4] An efficient approach for sentiment analysis using machine learning algorithm
    A. Naresh
    P. Venkata Krishna
    [J]. Evolutionary Intelligence, 2021, 14 : 725 - 731
  • [5] An efficient approach for sentiment analysis using machine learning algorithm
    Naresh, A.
    Krishna, R. Venkata
    [J]. EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 725 - 731
  • [6] Visual Sentiment Analysis With Active Learning
    Chen, Jie
    Mao, Qirong
    Xue, Luoyang
    [J]. IEEE ACCESS, 2020, 8 : 185899 - 185908
  • [7] Active learning for Arabic sentiment analysis
    Kaseb, Abdelrahman
    Farouk, Mona
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2023, 77 : 177 - 187
  • [8] Active Learning for Turkish Sentiment Analysis
    Cetin, Mahmut
    Amasyali, M. Fatih
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [9] Image Annotation Using Label Propagation Algorithm
    Marukatat, Sanparith
    [J]. ECTI-CON 2008: PROCEEDINGS OF THE 2008 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY, VOLS 1 AND 2, 2008, : 57 - 60
  • [10] Comparative Sentiment Analysis using Difference Types of Machine Learning Algorithm
    Hossain, Rakib
    Ahamed, Fowjael
    Zannat, Raihana
    Rabbani, Md Golam
    [J]. PROCEEDINGS OF THE 2019 8TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2019), 2019, : 329 - 333