Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis

被引:77
|
作者
Fu, Xianghua [1 ]
Liu, Wangwang [1 ]
Xu, Yingying [1 ]
Cui, Laizhong [1 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Recursive autoencoder; HowNet lexicon; Phrase structure tree;
D O I
10.1016/j.neucom.2017.01.079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting sentiment of sentences in online reviews is still a challenging task. Traditional machine learning methods often use bag-of-words representations which cannot properly capture complex linguistic phenomena in sentiment analysis. Recently, recursive autoencoder (RAE) methods have been proposed for sentence-level sentiment analysis. They use word embedding to represent each word, and learn compositional vector representation of phrases and sentences with recursive autoencoders. Although RAE methods outperform other state-of-the-art sentiment prediction approaches on commonly used datasets, they tend to generate very deep parse trees, and need a large amount of labeled data for each node during the process of learning compositional vector representations. Furthermore, RAE methods mainly combine adjacent words in sequence with a greedy strategy, which make capturing semantic relations between distant words difficult. To solve these issues, we propose a semi-supervised method which combines HowNet lexicon to train phrase recursive autoencoders (we call it CHL-PRAE). CHL-PRAE constructs the phrase recursive autoencoder (PRAE) model at first. Then the model calculates the sentiment orientation of each node with the HowNet lexicon, which acts as sentiment labels, when we train the softmax classifier of PRAE. Furthermore, our CHL-PRAE model conducts bidirectional training to capture global information. Compared with RAE and some supervised methods such as support vector machine (SVM) and naive Bayesian on English and Chinese datasets, the experiment results show that CHL-PRAE can provide the best performance for sentence-level sentiment analysis. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:18 / 27
页数:10
相关论文
共 50 条
  • [21] Neural Sentence-level Sentiment Classification with Heterogeneous Supervision
    Yuan, Zhigang
    Wu, Fangzhao
    Liu, Junxin
    Wu, Chuhan
    Huang, Yongfeng
    Xie, Xing
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1410 - 1415
  • [22] Learning with Noisy Labels for Sentence-level Sentiment Classification
    Wang, Hao
    Liu, Bing
    Li, Chaozhuo
    Yang, Yan
    Li, Tianrui
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 6286 - 6292
  • [23] Correlations and Fractality in Sentence-Level Sentiment Analysis Based on VADER for Literary Texts
    Hernandez-Perez, Ricardo
    Lara-Martinez, Pablo
    Obregon-Quintana, Bibiana
    Liebovitch, Larry S.
    Guzman-Vargas, Lev
    INFORMATION, 2024, 15 (11)
  • [24] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Amna Altaf
    Muhammad Waqas Anwar
    Muhammad Hasan Jamal
    Usama Ijaz Bajwa
    Multimedia Tools and Applications, 2023, 82 : 41813 - 41839
  • [25] Sentence-Level Sentiment Analysis Using Feature Vectors from Word Embeddings
    Hayashi, Toshitaka
    Fujita, Hamido
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 749 - 758
  • [26] Combining Domain-Specific Sentiment Lexicon with Hownet for Chinese Sentiment Analysis
    Liu, Lizhen
    Lei, Mengyun
    Wang, Hanshi
    JOURNAL OF COMPUTERS, 2013, 8 (04) : 878 - 883
  • [27] Exploiting Linguistic Features for Effective Sentence-Level Sentiment Analysis in Urdu Language
    Altaf, Amna
    Anwar, Muhammad Waqas
    Jamal, Muhammad Hasan
    Bajwa, Usama Ijaz
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (27) : 41813 - 41839
  • [28] Context-aware Learning for Sentence-level Sentiment Analysis with Posterior Regularization
    Yang, Bishan
    Cardie, Claire
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 325 - 335
  • [29] Sentence-Level Sentiment Polarity Classification Using a Linguistic Approach
    Tan, Luke Kien-Weng
    Na, Jin-Cheon
    Theng, Yin-Leng
    Chang, Kuiyu
    DIGITAL LIBRARIES: FOR CULTURAL HERITAGE, KNOWLEDGE DISSEMINATION, AND FUTURE CREATION: ICADL 2011, 2011, 7008 : 77 - +
  • [30] MCP-LSTM Network for Sentence-level Sentiment Classification
    Long, Yanlin
    Li, Yanmei
    Luo, Jian
    Miao, Chen
    Fu, Jing
    2019 INTERNATIONAL CONFERENCE ON VIRTUAL REALITY AND VISUALIZATION (ICVRV), 2019, : 124 - 128