Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis

被引:77
|
作者
Fu, Xianghua [1 ]
Liu, Wangwang [1 ]
Xu, Yingying [1 ]
Cui, Laizhong [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Recursive autoencoder; HowNet lexicon; Phrase structure tree;
D O I
10.1016/j.neucom.2017.01.079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting sentiment of sentences in online reviews is still a challenging task. Traditional machine learning methods often use bag-of-words representations which cannot properly capture complex linguistic phenomena in sentiment analysis. Recently, recursive autoencoder (RAE) methods have been proposed for sentence-level sentiment analysis. They use word embedding to represent each word, and learn compositional vector representation of phrases and sentences with recursive autoencoders. Although RAE methods outperform other state-of-the-art sentiment prediction approaches on commonly used datasets, they tend to generate very deep parse trees, and need a large amount of labeled data for each node during the process of learning compositional vector representations. Furthermore, RAE methods mainly combine adjacent words in sequence with a greedy strategy, which make capturing semantic relations between distant words difficult. To solve these issues, we propose a semi-supervised method which combines HowNet lexicon to train phrase recursive autoencoders (we call it CHL-PRAE). CHL-PRAE constructs the phrase recursive autoencoder (PRAE) model at first. Then the model calculates the sentiment orientation of each node with the HowNet lexicon, which acts as sentiment labels, when we train the softmax classifier of PRAE. Furthermore, our CHL-PRAE model conducts bidirectional training to capture global information. Compared with RAE and some supervised methods such as support vector machine (SVM) and naive Bayesian on English and Chinese datasets, the experiment results show that CHL-PRAE can provide the best performance for sentence-level sentiment analysis. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 27
页数:10
相关论文
共 50 条
  • [31] Word Embeddings-based Sentence-Level Sentiment Analysis considering Word Importance
    Hayashi, Toshitaka
    Fujita, Hamido
    ACTA POLYTECHNICA HUNGARICA, 2019, 16 (07) : 7 - 24
  • [32] A Chinese Short Text Similarity Method Integrating Sentence-Level and Phrase-Level Semantics
    Shen, Zhenji
    Xiao, Zhiyong
    ELECTRONICS, 2024, 13 (24):
  • [34] Chinese Sentence-level Event Factuality Identification with Recursive Neural Network
    Yi, Qingqing
    Qian, Zhong
    Li, Peifeng
    Zhu, Qiaoming
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [35] A Dynamic Conditional Random Field Based Framework for Sentence-level Sentiment Analysis of Chinese Microblog
    Hao, Zhifeng
    Cai, Ruichu
    Yang, Yiyang
    Wen, Wen
    Liang, Lixin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 135 - 142
  • [36] The synergy of double attention: Combine sentence-level and word-level attention for image captioning
    Wei, Haiyang
    Li, Zhixin
    Zhang, Canlong
    Ma, Huifang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2020, 201
  • [37] A Deep Neural Architecture for Sentence-Level Sentiment Classification in Twitter Social Networking
    Huy Nguyen
    Minh-Le Nguyen
    COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 15 - 27
  • [38] Sentence-Level Sentiment Classification A Comparative Study between Deep Learning Models
    Mifrah S.
    Benlahmar E.H.
    Journal of ICT Standardization, 2022, 10 (02): : 339 - 352
  • [39] NileULex: A Phrase and Word Level Sentiment Lexicon for Egyptian and Modern Standard Arabic
    El-Beltagy, Samhaa R.
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2900 - 2905
  • [40] Do Sentence-Level Sentiment Interactions Matter? Sentiment Mixed Heterogeneous Network for Fake News Detection
    Zhang, Hao
    Li, Zonglin
    Liu, Sanya
    Huang, Tao
    Ni, Zhouwei
    Zhang, Jian
    Lv, Zhihan
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5090 - 5100