Learning to share by masking the non-shared for multi-domain sentiment classification

被引:8
|
作者
Yuan, Jianhua [1 ]
Zhao, Yanyan [1 ]
Qin, Bing [1 ,2 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China
[2] Pengcheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural language processing; Sentiment analysis; Cross domain; Masking;
D O I
10.1007/s13042-022-01556-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-domain sentiment classification deals with the scenario where labeled data exists for multiple domains but is insufficient for training effective sentiment classifiers that work across domains. Thus, fully exploiting sentiment knowledge shared across domains is crucial for real-world applications. While many existing works try to extract domain-invariant features in high-dimensional space, such models fail to explicitly distinguish between shared and private features at the text level, which to some extent lacks interpretability. Based on the assumption that removing domain-related tokens from texts would help improve their domain invariance, we instead first transform original sentences to be domain-agnostic. To this end, we propose the BERTMasker model which explicitly masks domain-related words from texts, learns domain-invariant sentiment features from these domain-agnostic texts and uses those masked words to form domain-aware sentence representations. Empirical experiments on the benchmark multiple domain sentiment classification datasets demonstrate the effectiveness of our proposed model, which improves the accuracy on multi-domain and cross-domain settings by 1.91% and 3.31% respectively. Further analysis on masking proves that removing those domain-related and sentiment irrelevant tokens decreases texts' domain separability, resulting in the performance degradation of a BERT-based domain classifier by over 12%.
引用
下载
收藏
页码:2711 / 2724
页数:14
相关论文
共 50 条
  • [41] A Neural Word Embeddings Approach for Multi-Domain Sentiment Analysis
    Dragoni, Mauro
    Petrucci, Giulio
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (04) : 457 - 470
  • [43] A fuzzy-based strategy for multi-domain sentiment analysis
    Dragoni, Mauro
    Petrucci, Giulio
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2018, 93 : 59 - 73
  • [44] Multi-Domain Active Learning for Recommendation
    Zhang, Zihan
    Jin, Xiaoming
    Li, Lianghao
    Ding, Guiguang
    Yang, Qiang
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2358 - 2364
  • [45] A Weak-supervision Method for Automating Training Set Creation in Multi-domain Aspect Sentiment Classification
    Ruffolo, Massimo
    Visalli, Francesco
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 249 - 256
  • [46] Multi-source domain adaptation with joint learning for cross-domain sentiment classification
    Zhao, Chuanjun
    Wang, Suge
    Li, Deyu
    KNOWLEDGE-BASED SYSTEMS, 2020, 191
  • [47] A Multi-domain Text Classification Method Based on Recurrent Convolution Multi-task Learning
    Xie Jinbao
    Li Jiahui
    Kang Shouqiang
    Wang Qingyan
    Wang Yujing
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (08) : 2395 - 2403
  • [48] Challenges and Recommended Solutions in Multi-Source and Multi-Domain Sentiment Analysis
    Abdullah, Nor Aniza
    Feizollah, Ali
    Sulaiman, Ainin
    Anuar, Nor Badrul
    IEEE ACCESS, 2019, 7 : 144957 - 144971
  • [49] Shared path protection in multi-domain optical mesh networks
    Thiongane, B
    Truong, DL
    Proceedings of the Third IASTED International Conference on Communications and Computer Networks, 2005, : 138 - 145
  • [50] Fine-Grained Sentiment Analysis of Multi-domain Online Reviews
    Theodoropoulos, Panagiotis
    Alexandris, Christina
    HUMAN-COMPUTER INTERACTION: TECHNOLOGICAL INNOVATION, PT II, 2022, 13303 : 264 - 278