Imbalanced text sentiment classification using universal and domain-specific knowledge

被引:63
|
作者
Li, Yijing [1 ,2 ]
Guo, Haixiang [1 ,2 ,3 ]
Zhang, Qingpeng [4 ]
Gu, Mingyun [1 ,2 ]
Yang, Jianying [5 ]
机构
[1] China Univ Geosci, Coll Econ & Management, Wuhan 430074, Hubei, Peoples R China
[2] China Univ Geosci, Res Ctr Digital Business Management, Wuhan 430074, Hubei, Peoples R China
[3] China Univ Geosci, Mineral Resource Strategy & Policy Res Ctr, Wuhan 430074, Hubei, Peoples R China
[4] City Univ Hong Kong, Dept Syst Engn & Engn Management, Kowloon, Hong Kong, Peoples R China
[5] Wuhan Ctr China Geol Survey, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Label propagation; Imbalanced data; Ensemble learning; LEXICON; MODEL;
D O I
10.1016/j.knosys.2018.06.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a sentiment classification model is proposed to address two predominant issues in sentiment classification, namely domain-sensitive and data imbalance. Since words may embed distinct sentiment polarities in different contexts, sentiment classification is widely contended as a domain-sensitive task. Accordingly, this paper draws on label propagation to induce universal and domain-specific sentiment lexicons and builds a domain-adaptive sentiment classification model that incorporates universal and domain-specific knowledge into a unified learning framework. On the flip side, sentiment-related corpuses are usually formed with skewed polarity distribution because individuals tend to share similar assessment criteria on a given object and hence their sentiment polarities toward the same object are likely to be similar. We endeavor to address such imbalanced data problem by advancing a novel over-sampling technique. Unlike existing over-sampling approaches that generate minority-class samples from numerical feature space, the proposed sampling method directly creates synthetic texts from word spaces. Several experiments are conducted to verify the effectiveness of the proposed lexicon generation method, learning framework, and over-sampling method. Results show that the induced sentiment lexicons are interpretable and the proposed model is found to be effective for imbalanced and domain-specific text sentiment classification.
引用
下载
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] Domain-Specific Long Text Classification from Sparse Relevant Information
    D'Cruz, Célia
    Bereder, Jean-Marc
    Precioso, Frédéric
    Riveill, Michel
    Frontiers in Artificial Intelligence and Applications, 392 : 4003 - 4010
  • [22] Automatic domain-specific term extraction and its application in text classification
    Liu, Tao
    Liu, Bing-Quan
    Xu, Zhi-Ming
    Wang, Xiao-Long
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2007, 35 (02): : 328 - 332
  • [23] Classification of heterogeneous text data for robust domain-specific language modeling
    Stas, Jan
    Juhar, Jozef
    Hladek, Daniel
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014,
  • [24] Classification of heterogeneous text data for robust domain-specific language modeling
    Ján Staš
    Jozef Juhár
    Daniel Hládek
    EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [25] A Cross-Domain Aspect-Based Sentiment Classification by Masking the Domain-Specific Words
    Lee, Junhee
    Frasincar, Flavius
    Trusca, Maria Mihaela
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 1595 - 1602
  • [26] Generate domain-specific sentiment lexicon for review sentiment analysis
    Hongyu Han
    Jianpei Zhang
    Jing Yang
    Yiran Shen
    Yongshi Zhang
    Multimedia Tools and Applications, 2018, 77 : 21265 - 21280
  • [27] Generate domain-specific sentiment lexicon for review sentiment analysis
    Han, Hongyu
    Zhang, Jianpei
    Yang, Jing
    Shen, Yiran
    Zhang, Yongshi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (16) : 21265 - 21280
  • [28] Context-sensitive lexicon for imbalanced text sentiment classification using bidirectional LSTM
    Kumar, M. R. Pavan
    Jayagopal, Prabhu
    JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (05) : 2123 - 2132
  • [29] Financial sentiment analysis model utilizing knowledge-base and domain-specific representation
    Agarwal, Basant
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (06) : 8899 - 8920
  • [30] Financial sentiment analysis model utilizing knowledge-base and domain-specific representation
    Basant Agarwal
    Multimedia Tools and Applications, 2023, 82 : 8899 - 8920