Imbalanced text sentiment classification using universal and domain-specific knowledge

被引:63
|
作者
Li, Yijing [1 ,2 ]
Guo, Haixiang [1 ,2 ,3 ]
Zhang, Qingpeng [4 ]
Gu, Mingyun [1 ,2 ]
Yang, Jianying [5 ]
机构
[1] China Univ Geosci, Coll Econ & Management, Wuhan 430074, Hubei, Peoples R China
[2] China Univ Geosci, Res Ctr Digital Business Management, Wuhan 430074, Hubei, Peoples R China
[3] China Univ Geosci, Mineral Resource Strategy & Policy Res Ctr, Wuhan 430074, Hubei, Peoples R China
[4] City Univ Hong Kong, Dept Syst Engn & Engn Management, Kowloon, Hong Kong, Peoples R China
[5] Wuhan Ctr China Geol Survey, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Label propagation; Imbalanced data; Ensemble learning; LEXICON; MODEL;
D O I
10.1016/j.knosys.2018.06.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a sentiment classification model is proposed to address two predominant issues in sentiment classification, namely domain-sensitive and data imbalance. Since words may embed distinct sentiment polarities in different contexts, sentiment classification is widely contended as a domain-sensitive task. Accordingly, this paper draws on label propagation to induce universal and domain-specific sentiment lexicons and builds a domain-adaptive sentiment classification model that incorporates universal and domain-specific knowledge into a unified learning framework. On the flip side, sentiment-related corpuses are usually formed with skewed polarity distribution because individuals tend to share similar assessment criteria on a given object and hence their sentiment polarities toward the same object are likely to be similar. We endeavor to address such imbalanced data problem by advancing a novel over-sampling technique. Unlike existing over-sampling approaches that generate minority-class samples from numerical feature space, the proposed sampling method directly creates synthetic texts from word spaces. Several experiments are conducted to verify the effectiveness of the proposed lexicon generation method, learning framework, and over-sampling method. Results show that the induced sentiment lexicons are interpretable and the proposed model is found to be effective for imbalanced and domain-specific text sentiment classification.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [1] Domain-specific sentiment classification via fusing sentiment knowledge from multiple sources
    Wu, Fangzhao
    Huang, Yongfeng
    Yuan, Zhigang
    [J]. INFORMATION FUSION, 2017, 35 : 26 - 37
  • [2] Knowledge management tools: Universal and domain-specific
    Kudryavtsev, Dmitry
    Gavrilova, Tatiana
    Menshikova, Anna
    [J]. IFKAD 2017: 12TH INTERNATIONAL FORUM ON KNOWLEDGE ASSET DYNAMICS: KNOWLEDGE MANAGEMENT IN THE 21ST CENTURY: RESILIENCE, CREATIVITY AND CO-CREATION, 2017, : 1774 - 1784
  • [3] Domain-specific knowledge acquisition from text
    Moldovan, D
    Girju, R
    Rus, V
    [J]. 6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : 268 - 275
  • [4] Automatic construction of domain-specific sentiment lexicon for unsupervised domain adaptation and sentiment classification
    Beigi, Omid Mohamad
    Moattar, Mohammad H.
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [5] A comprehensive study of domain-specific emoji meanings in sentiment classification
    Mahmoudi, Nader
    Olech, Lukasz P.
    Docherty, Paul
    [J]. COMPUTATIONAL MANAGEMENT SCIENCE, 2022, 19 (02) : 159 - 197
  • [6] A comprehensive study of domain-specific emoji meanings in sentiment classification
    Nader Mahmoudi
    Łukasz P. Olech
    Paul Docherty
    [J]. Computational Management Science, 2022, 19 : 159 - 197
  • [7] Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis
    Nir Ofek
    Soujanya Poria
    Lior Rokach
    Erik Cambria
    Amir Hussain
    Asaf Shabtai
    [J]. Cognitive Computation, 2016, 8 : 467 - 477
  • [8] Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis
    Ofek, Nir
    Poria, Soujanya
    Rokach, Lior
    Cambria, Erik
    Hussain, Amir
    Shabtai, Asaf
    [J]. COGNITIVE COMPUTATION, 2016, 8 (03) : 467 - 477
  • [9] IDENTIFYING DOMAIN-SPECIFIC SENSES AND ITS APPLICATION TO TEXT CLASSIFICATION
    Fukumoto, Fumiyo
    Suzuki, Yoshimi
    [J]. KEOD 2010: Proceedings of the International Conference on Knowledge Engineering and Ontology Development, 2010, : 263 - 268
  • [10] Text classification based filters for a domain-specific search engine
    Schmidt, Sebastian
    Schnitzer, Steffen
    Rensing, Christoph
    [J]. COMPUTERS IN INDUSTRY, 2016, 78 : 70 - 79