Word Embeddings-based Sentence-Level Sentiment Analysis considering Word Importance

被引:17
|
作者
Hayashi, Toshitaka [1 ]
Fujita, Hamido [1 ]
机构
[1] Iwate Prefectural Univ, 152-52 Sugo, Takizawa 0200693, Japan
关键词
Sentiment Analysis; Polarity Classification; Word Embeddings; Word Importance; MODEL;
D O I
10.12700/APH.16.7.2019.7.1
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Word2vec has been proven to facilitate various Natural Language Processing (NLP) tasks. We suppose that it could separate the vector space of word2vec into positive and negative. Hence, word2vec can be applied to Sentiment Analysis tasks. In our previous research, we proposed the word embeddings (WEMB) based Sentence-level Sentiment Analysis method. Word's vectors from WEMB are utilized to calculate the sentence vector. Training of the classification model is done using sentence vector and the polarity. After training, the model predicts the polarity of the unlabeled sentence. However, the sentence vector was insufficient because the method treats all words with the same weight for calculating a sentence vector. In this paper, we propose a method to solve this problem. We consider word weight according to their importance for calculating sentence vector. The proposed method is compared with the method without word importance, and the accuracy is improved. However, there is still a grim difference with state of the art. We discuss the next improvement and present future work.
引用
收藏
页码:7 / 24
页数:18
相关论文
共 50 条
  • [1] Sentence-Level Sentiment Analysis Using Feature Vectors from Word Embeddings
    Hayashi, Toshitaka
    Fujita, Hamido
    [J]. NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 749 - 758
  • [2] Learning Sentiment-inherent Word Embedding for Word-level and Sentence-Level Sentiment Analysis
    Zhang, Zhihua
    Lan, Man
    [J]. PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 94 - 97
  • [3] Document Summarization Using Sentence-Level Semantic Based on Word Embeddings
    Al-Sabahi, Kamal
    Zhang Zuping
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2019, 29 (02) : 177 - 196
  • [4] Sentence-level Sentiment Analysis Using GCN on Contextualized Word Representations
    Huyen Trang Phan
    Ngoc Thanh Nguyen
    Mazur, Zygmunt
    Hwang, Dosam
    [J]. COMPUTATIONAL SCIENCE, ICCS 2022, PT II, 2022, : 690 - 702
  • [5] BEYOND WORD-LEVEL TO SENTENCE-LEVEL SENTIMENT ANALYSIS FOR FINANCIAL REPORTS
    Du, Chi-Han
    Tsai, Ming-Feng
    Wang, Chuan-Ju
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1562 - 1566
  • [6] Image Captioning Based On Sentence-Level And Word-Level Attention
    Wei, Haiyang
    Li, Zhixin
    Zhang, Canlong
    Zhou, Tao
    Quan, Yu
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] EFFECTS OF WORD-LEVEL AND SENTENCE-LEVEL CONTEXTS UPON WORD RECOGNITION
    COLOMBO, L
    WILLIAMS, J
    [J]. MEMORY & COGNITION, 1990, 18 (02) : 153 - 163
  • [8] Performance Evaluation of Word and Sentence Embeddings for Finance Headlines Sentiment Analysis
    Mishev, Kostadin
    Gjorgjevikj, Ana
    Stojanov, Riste
    Mishkovski, Igor
    Vodenska, Irena
    Chitkushev, Ljubomir
    Trajanov, Dimitar
    [J]. ICT INNOVATIONS 2019: BIG DATA PROCESSING AND MINING, 2019, 1110 : 161 - 172
  • [9] Sentence-Level Sentiment Analysis in Persian
    Basiri, Mohammad Ehsan
    Kabiri, Arman
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (IPRIA), 2017, : 84 - 89
  • [10] Word Embeddings-Based Uncertainty Detection in Financial Disclosures
    Theil, Christoph Kilian
    Stajner, Sanja
    Stuckenschmidt, Heiner
    [J]. ECONOMICS AND NATURAL LANGUAGE PROCESSING (ECONLP 2018), 2018, : 32 - 37