Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis

被引:77
|
作者
Fu, Xianghua [1 ]
Liu, Wangwang [1 ]
Xu, Yingying [1 ]
Cui, Laizhong [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Recursive autoencoder; HowNet lexicon; Phrase structure tree;
D O I
10.1016/j.neucom.2017.01.079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting sentiment of sentences in online reviews is still a challenging task. Traditional machine learning methods often use bag-of-words representations which cannot properly capture complex linguistic phenomena in sentiment analysis. Recently, recursive autoencoder (RAE) methods have been proposed for sentence-level sentiment analysis. They use word embedding to represent each word, and learn compositional vector representation of phrases and sentences with recursive autoencoders. Although RAE methods outperform other state-of-the-art sentiment prediction approaches on commonly used datasets, they tend to generate very deep parse trees, and need a large amount of labeled data for each node during the process of learning compositional vector representations. Furthermore, RAE methods mainly combine adjacent words in sequence with a greedy strategy, which make capturing semantic relations between distant words difficult. To solve these issues, we propose a semi-supervised method which combines HowNet lexicon to train phrase recursive autoencoders (we call it CHL-PRAE). CHL-PRAE constructs the phrase recursive autoencoder (PRAE) model at first. Then the model calculates the sentiment orientation of each node with the HowNet lexicon, which acts as sentiment labels, when we train the softmax classifier of PRAE. Furthermore, our CHL-PRAE model conducts bidirectional training to capture global information. Compared with RAE and some supervised methods such as support vector machine (SVM) and naive Bayesian on English and Chinese datasets, the experiment results show that CHL-PRAE can provide the best performance for sentence-level sentiment analysis. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 27
页数:10
相关论文
共 50 条
  • [11] Uninorm Operators for Sentence-Level Score Aggregation in Sentiment Analysis
    Basiri, Mohammad Ehsan
    Kabiri, Arman
    2018 4TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2018, : 97 - 102
  • [12] Sentence-level Sentiment Classification with Weak Supervision
    Wu, Fangzhao
    Zhang, Jia
    Yuan, Zhigang
    Wu, Sixing
    Huang, Yongfeng
    Yan, Jun
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 973 - 976
  • [13] Role of Emoticons in Sentence-Level Sentiment Classification
    Min, Martin
    Lee, Tanya
    Hsu, Ray
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA, 2013, 8208 : 203 - 213
  • [14] BEYOND WORD-LEVEL TO SENTENCE-LEVEL SENTIMENT ANALYSIS FOR FINANCIAL REPORTS
    Du, Chi-Han
    Tsai, Ming-Feng
    Wang, Chuan-Ju
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1562 - 1566
  • [15] Sentence-level Sentiment Analysis Using GCN on Contextualized Word Representations
    Huyen Trang Phan
    Ngoc Thanh Nguyen
    Mazur, Zygmunt
    Hwang, Dosam
    COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 690 - 702
  • [16] Sentence-level sentiment analysis based on supervised gradual machine learning
    Su, Jing
    Chen, Qun
    Wang, Yanyan
    Zhang, Lijun
    Pan, Wei
    Li, Zhanhuai
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [17] A comparative study of machine translation for multilingual sentence-level sentiment analysis
    Araujo, Matheus
    Pereira, Adriano
    Benevenuto, Fabricio
    INFORMATION SCIENCES, 2020, 512 : 1078 - 1102
  • [18] A Fuzzy Graph Convolutional Network Model for Sentence-Level Sentiment Analysis
    Phan, Huyen Trang
    Nguyen, Ngoc Thanh
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (05) : 2953 - 2965
  • [19] Sentence-level sentiment analysis based on supervised gradual machine learning
    Jing Su
    Qun Chen
    Yanyan Wang
    Lijun Zhang
    Wei Pan
    Zhanhuai Li
    Scientific Reports, 13
  • [20] Learning Sentiment-inherent Word Embedding for Word-level and Sentence-Level Sentiment Analysis
    Zhang, Zhihua
    Lan, Man
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 94 - 97