Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis

被引:77
|
作者
Fu, Xianghua [1 ]
Liu, Wangwang [1 ]
Xu, Yingying [1 ]
Cui, Laizhong [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Recursive autoencoder; HowNet lexicon; Phrase structure tree;
D O I
10.1016/j.neucom.2017.01.079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting sentiment of sentences in online reviews is still a challenging task. Traditional machine learning methods often use bag-of-words representations which cannot properly capture complex linguistic phenomena in sentiment analysis. Recently, recursive autoencoder (RAE) methods have been proposed for sentence-level sentiment analysis. They use word embedding to represent each word, and learn compositional vector representation of phrases and sentences with recursive autoencoders. Although RAE methods outperform other state-of-the-art sentiment prediction approaches on commonly used datasets, they tend to generate very deep parse trees, and need a large amount of labeled data for each node during the process of learning compositional vector representations. Furthermore, RAE methods mainly combine adjacent words in sequence with a greedy strategy, which make capturing semantic relations between distant words difficult. To solve these issues, we propose a semi-supervised method which combines HowNet lexicon to train phrase recursive autoencoders (we call it CHL-PRAE). CHL-PRAE constructs the phrase recursive autoencoder (PRAE) model at first. Then the model calculates the sentiment orientation of each node with the HowNet lexicon, which acts as sentiment labels, when we train the softmax classifier of PRAE. Furthermore, our CHL-PRAE model conducts bidirectional training to capture global information. Compared with RAE and some supervised methods such as support vector machine (SVM) and naive Bayesian on English and Chinese datasets, the experiment results show that CHL-PRAE can provide the best performance for sentence-level sentiment analysis. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 27
页数:10
相关论文
共 50 条
  • [41] Comparing Sentence-Level Features for Authorship Analysis in Portuguese
    Sousa-Silva, Rui
    Sarmento, Luis
    Grant, Tim
    Oliveira, Eugenio
    Maia, Belinda
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2010, 6001 : 51 - +
  • [42] Three-way enhanced convolutional neural networks for sentence-level sentiment classification
    Zhang, Yuebing
    Zhang, Zhifei
    Miao, Duoqian
    Wang, Jiaqi
    INFORMATION SCIENCES, 2019, 477 : 55 - 64
  • [43] Recurrence Quantification Analysis of Sentence-Level Speech Kinematics
    Jackson, Eric S.
    Tiede, Mark
    Riley, Michael A.
    Whalen, D. H.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2016, 59 (06): : 1315 - 1326
  • [44] Sentiment classification in English from sentence-level annotations of emotions regarding models of affect
    Trilla, Alexandre
    Alias, Francesc
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 508 - 511
  • [45] Sentence-Level Sentiment Analysis of Financial News Using Distributed Text Representations and Multi-Instance Learning
    Lutz, Bernhard
    Prollochs, Nicolas
    Neumann, Dirk
    PROCEEDINGS OF THE 52ND ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2019, : 1116 - 1125
  • [46] FeDN2: Fuzzy-Enhanced Deep Neural Networks for Improvement of Sentence-Level Sentiment Analysis
    Phan, Huyen Trang
    Pham, Dinh Tai
    Nguyen, Ngoc Thanh
    CYBERNETICS AND SYSTEMS, 2023,
  • [47] Multi-aspect Blog Sentiment Analysis Based on LDA Topic Model and Hownet Lexicon
    Fu, Xianghua
    Liu, Guo
    Guo, Yanyan
    Guo, Wubiao
    WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 131 - 138
  • [48] Review-Based Sentiment Prediction of Rating Using Natural Language Processing Sentence-Level Sentiment Analysis with Bag-of-Words Approach
    Raju, K. Venkata
    Sridhar, M.
    FIRST INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR COMPUTATIONAL INTELLIGENCE, 2020, 1045 : 807 - 821
  • [49] A Method for Extracting Lexicon for Sentiment Analysis Based on Morphological Sentence Patterns
    Han, Youngsub
    Kim, Yanggon
    Jang, Ikhyeon
    SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS, 2016, 654 : 85 - 101
  • [50] Sentence-Level Automatic Lecture Highlighting Based on Acoustic Analysis
    Che, Xiaoyin
    Luo, Sheng
    Yang, Haojin
    Meinel, Christoph
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2016, : 328 - 334