Bayesian estimation-based sentiment word embedding model for sentiment analysis

被引:8
|
作者
Tang, Jingyao [1 ]
Xue, Yun [1 ]
Wang, Ziwen [1 ]
Hu, Shaoyang [2 ,3 ]
Gong, Tao [4 ,5 ]
Chen, Yinong [6 ]
Zhao, Haoliang [1 ]
Xiao, Luwei [1 ]
机构
[1] South China Normal Univ, Sch Phys & Telecommun Engn, Guangdong Prov Key Lab Quantum Engn & Quantum Mat, Guangzhou, Peoples R China
[2] South China Agr Univ, Coll Math & Informat, Guangzhou, Peoples R China
[3] South China Agr Univ, Coll Software Engn, Guangzhou, Peoples R China
[4] Zhejiang Univ Finance & Econ, Sch Foreign Languages, Hangzhou, Zhejiang, Peoples R China
[5] Educ Testing Serv, Princeton, NJ 08541 USA
[6] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ USA
关键词
All Open Access; Gold;
D O I
10.1049/cit2.12037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment word embedding has been extensively studied and used in sentiment analysis tasks. However, most existing models have failed to differentiate high-frequency and low-frequency words. Accordingly, the sentiment information of low-frequency words is insufficiently captured, thus resulting in inaccurate sentiment word embedding and degradation of overall performance of sentiment analysis. A Bayesian estimation-based sentiment word embedding (BESWE) model, which aims to precisely extract the sentiment information of low-frequency words, has been proposed. In the model, a Bayesian estimator is constructed based on the co-occurrence probabilities and sentiment probabilities of words, and a novel loss function is defined for sentiment word embedding learning. The experimental results based on the sentiment lexicons and Movie Review dataset show that BESWE outperforms many state-of-the-art methods, for example, C&W, CBOW, GloVe, SE-HyRank and DLJT1, in sentiment analysis tasks, which demonstrate that Bayesian estimation can effectively capture the sentiment information of low-frequency words and integrate the sentiment information into the word embedding through the loss function. In addition, replacing the embedding of low-frequency words in the state-of-the-art methods with BESWE can significantly improve the performance of those methods in sentiment analysis tasks.
引用
收藏
页码:144 / 155
页数:12
相关论文
共 50 条
  • [1] Sentiment Analysis with Word Embedding
    Deho, Oscar B.
    Agangiba, William A.
    Aryeh, Felix L.
    Ansah, Jeffery A.
    [J]. 2018 IEEE 7TH INTERNATIONAL CONFERENCE ON ADAPTIVE SCIENCE & TECHNOLOGY (IEEE ICAST), 2018,
  • [2] Emotional Similarity Word Embedding Model for Sentiment Analysis
    Matsumoto, Kazuyuki
    Matsunaga, Takumi
    Yoshida, Minoru
    Kita, Kenji
    [J]. COMPUTACION Y SISTEMAS, 2022, 26 (02): : 875 - 886
  • [3] Sentiment-Specific Word Embedding for Indonesian Sentiment Analysis
    Farhan, Ahmad Naufal
    Khodra, Masayu Leylia
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS, CONCEPTS, THEORY, AND APPLICATIONS (ICAICTA) PROCEEDINGS, 2017,
  • [4] A New Sentiment-Enhanced Word Embedding Method for Sentiment Analysis
    Li, Qizhi
    Li, Xianyong
    Du, Yajun
    Fan, Yongquan
    Chen, Xiaoliang
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (20):
  • [5] Exploring Word Embedding for Arabic Sentiment Analysis
    Gayed, Sana
    Mallat, Souheyl
    Zrigui, Mounir
    [J]. RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, 2022, 1716 : 92 - 101
  • [6] Exploring Word Embedding For Bangla Sentiment Analysis
    Sumit, Sakhawat Hosain
    Hossan, Md. Zakir
    Al Muntasir, Tareq
    Sourov, Tanvir
    [J]. 2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [7] Document Sentiment Classification based on the Word Embedding
    Yin, Yanping
    Jin, Zhong
    [J]. PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON MECHATRONICS, MATERIALS, CHEMISTRY AND COMPUTER ENGINEERING 2015 (ICMMCCE 2015), 2015, 39 : 456 - 461
  • [8] Sentiment Analysis of Chinese Words Using Word Embedding and Sentiment Morpheme Matching
    Niu, Jianwei
    Sun, Mingsheng
    Mo, Shasha
    [J]. COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2017, 2018, 252 : 3 - 12
  • [9] Sentiment Analysis on Microblog Data based on Word Embedding and Fusion Techniques
    Hayran, Ahmet
    Sert, Mustafa
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [10] Probabilistic Neural Network and Word Embedding for Sentiment Analysis
    Alam, Saqib
    Yao, Nianmin
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (07) : 48 - 53