A Random Walk-Based Model for Identifying Semantic Orientation

被引:0
|
作者
Hassan, Ahmed [1 ]
Abu-Jbara, Amjad [2 ]
Lu, Wanchen [2 ]
Radev, Dragomir [2 ,3 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[3] Univ Michigan, Sch Informat, Ann Arbor, MI 48109 USA
关键词
Classification (of information) - Random processes - Text processing - Semantics;
D O I
10.1162/COLI_a_00192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatically identifying the sentiment polarity of words is a very important task that has been used as the essential building block of many natural language processing systems such as text classification, text filtering, product review analysis, survey response analysis, and on-line discussion mining. We propose a method for identifying the sentiment polarity of words that applies a Markov random walk model to a large word relatedness graph, and produces a polarity estimate for any given word. The model can accurately and quickly assign a polarity sign and magnitude to any word. It can be used both in a semi-supervised setting where a training set of labeled words is used, and in a weakly supervised setting where only a handful of seed words is used to define the two polarity classes. The method is experimentally tested using a gold standard set of positive and negative words from the General Inquirer lexicon. We also show how our method can be used for three-way classification which identifies neutral words in addition to positive and negative words. Our experiments show that the proposed method outperforms the state-of-the-art methods in the semi-supervised setting and is comparable to the best reported values in the weakly supervised setting. In addition, the proposed method is faster and does not need a large corpus. We also present extensions of our methods for identifying the polarity of foreign words and out-of-vocabulary words.
引用
收藏
页码:539 / 562
页数:24
相关论文
共 50 条
  • [1] RANDOM WALK-BASED SEGREGATION MEASURES
    Ballester, Coralio
    Vorsatz, Marc
    [J]. REVIEW OF ECONOMICS AND STATISTICS, 2014, 96 (03) : 383 - 401
  • [2] Estimating Walk-Based Similarities Using Random Walk
    Murai, Shogo
    Yoshida, Yuichi
    [J]. WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 1321 - 1331
  • [3] Random walk-based ranking in signed social networks: model and algorithms
    Jung, Jinhong
    Jin, Woojeong
    Kang, U.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (02) : 571 - 610
  • [4] Random walk-based ranking in signed social networks: model and algorithms
    Jinhong Jung
    Woojeong Jin
    U Kang
    [J]. Knowledge and Information Systems, 2020, 62 : 571 - 610
  • [5] The random walk-based gravity model to identify influential nodes in complex networks
    Zhao, Jie
    Wen, Tao
    Jahanshahi, Hadi
    Cheong, Kang Hao
    [J]. INFORMATION SCIENCES, 2022, 609 : 1706 - 1720
  • [6] Collective entity linking: a random walk-based perspective
    Liu, Ming
    Zhao, Yanyan
    Qin, Bing
    Liu, Ting
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (03) : 1611 - 1643
  • [7] Collective entity linking: a random walk-based perspective
    Ming Liu
    Yanyan Zhao
    Bing Qin
    Ting Liu
    [J]. Knowledge and Information Systems, 2019, 60 : 1611 - 1643
  • [8] TagClus: a random walk-based method for tag clustering
    Jianwei Cui
    Hongyan Liu
    Jun He
    Pei Li
    Xiaoyong Du
    Puwei Wang
    [J]. Knowledge and Information Systems, 2011, 27 : 193 - 225
  • [9] TagClus: a random walk-based method for tag clustering
    Cui, Jianwei
    Liu, Hongyan
    He, Jun
    Li, Pei
    Du, Xiaoyong
    Wang, Puwei
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 27 (02) : 193 - 225
  • [10] Using Heterogeneity to Enhance Random Walk-based Queries
    Zuniga, Marco
    Avin, Chen
    Krishnamachari, Bhaskar
    [J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2009, 57 (03): : 401 - 414