Cross-domain sentiment classification based on syntactic structure transfer and domain fusion

被引:0
|
作者
Zhao C. [1 ,2 ]
Wu M. [1 ]
Shen L. [3 ]
Shangguan X. [3 ]
Wang Y. [3 ]
Li J. [1 ]
Wang S. [4 ]
Li D. [4 ]
机构
[1] School of Information, Shanxi University of Finance and Economics, Taiyuan
[2] Economic Big Data Shanxi Province Key Laboratory, Shanxi University of Finance and Economics, Taiyuan
[3] Shanxi Information Technology Application Innovation Engineering Research Center, Taiyuan
[4] School of Computer and Information Technology, Shanxi University, Taiyuan
关键词
cross-domain sentiment classification; deep transfer learning; minimum distance constraint; syntactic structure transfer;
D O I
10.16511/j.cnki.qhdxxb.2023.21.012
中图分类号
学科分类号
摘要
[Objective] Deep learning models for text sentiment analysis, such as recurrent neural networks, often require many parameters and a large amount of high-quality labeled training data to effectively train and optimize recurrent neural networks. However, obtaining domain-specific high-quality sentiment-labeled data is a challenging task in practical applications. This study proposes a cross-domain text sentiment classification method based on syntactic structure transfer and domain fusion (SSTDF) to address the domain-invariant learning and distribution distance difference metric problems. This method can effectively alleviate the dependence on domain-specific annotated data due to the difference in the data distribution among different domains.[Methods] A method combining SSTDF was proposed in this study to solve the problem of cross-domain sentiment classification. Dependent syntactic features are introduced into the recurrent neural network for syntactic structure transfer for designing a migratable dependent syntactic recurrent neural network model. Furthermore, a parameter transfer strategy is employed to transfer syntactic structure information across domains efficiently for supporting sentiment transfer. The conditional maximum mean discrepancy distance metric is used in domain fusion to quantify the distribution differences between the source and target domains and further refine the cross-domain same-category distance metric information. By constraining the distributions of source and target domains, domain variable features are effectively extracted to maximize the sharing of sentiment information between source and target domains. In this paper, we used a joint optimization and training approach to address cross-domain sentiment classification. Specifically, the sentiment classification loss of source and target domains is minimized, and their fusion losses are fully considered in the joint optimization process. Hence, the generalization performance of the model and classification accuracy of the cross-domain sentiment classification task are considerably improved.[Results] The dataset used in this study is the sentiment classification dataset of Amazon English online reviews, which has been widely used in cross-domain sentiment classification studies; furthermore, it contains four domains—B (Books), D (DVD), E (Electronic), and K (Kitchen)—each with 1 000 positive and negative reviews. The experimental results show that the accuracy of the SSTDF method is higher than the baseline method, achieving 0. 844, 0. 830, and 0. 837 for average accuracy, recall, and Fl values, respectively. Fine-tuning allows the fast convergence of the network, thereby improving its transfer efficiency.[Conclusions] Finally, we used deep transfer learning methods to solve the task of cross-domain text sentiment classification from the perspective of cross-domain syntactic structure consistency learning. A recurrent neural network model that integrates syntactic structure information is used; additionally, a domain minimum distance constraint is added to the syntactic structure transfer process to ensure that the distance between the source and target domains is as similar as possible during the learning process. The effectiveness of the proposed method is finally verified using experimental results. The next step is to increase the number of experimental and neutral samples to validate the proposed method on a larger dataset. Furthermore, a more fine-grained aspect-level cross-domain sentiment analysis will be attempted in the future. © 2023 Press of Tsinghua University. All rights reserved.
引用
下载
收藏
页码:1380 / 1389
页数:9
相关论文
共 31 条
  • [1] LI T, CHEN X, ZHANG S H, Et al., Cross-domain sentiment classification with contrastive learning and mutual information maximization, Proceedings of the 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 8203-8207, (2021)
  • [2] ZHAO C J, WANG S G, LI D Y, Et al., Cross-domain sentiment classification via parameter transferring and attention sharing mechanism, Information Sciences, 578, pp. 281-296, (2021)
  • [3] WU Q, LIU Y, SHEN H W, Et al., A unified framework for cross-domain sentiment classification, Journal of Computer Research and Development, 50, 8, pp. 1683-1689, (2013)
  • [4] ZHAO C J, WANG S G, LI D Y., Research progress on cross-domain text sentiment classification, Journal of Software, 31, 6, pp. 1723-1746, (2020)
  • [5] LI L, YE W R, LONG M S, Et al., Simultaneous learning of pivots and representations for cross-domain sentiment classification, Proceedings of the 34th AAAI Conference on Artificial Intelligence, pp. 8220-8227, (2020)
  • [6] ZHAO C J, WANG S G, LI D Y, Et al., Cross-domain text sentiment classification based on Grouping-AdaBoost ensemble, Journal of Computer Research and Development, 52, 3, pp. 629-638, (2015)
  • [7] WEI X H, ZHANG S W, YANG L, Et al., Cross-domain sentiment analysis based on weighted SimRank, Pattern Recognition and Artificial Intelligence, 26, 11, pp. 1004-1009, (2013)
  • [8] ZHAO C J, WANG S G, LI D Y., Multi-source domain adaptation with joint learning for cross-domain sentiment classification [J], Knowledge-Based Systems, 191, (2020)
  • [9] YUE C Y, CAO H Q, XU G P, Et al., Collaborative attention neural network for multi-domain sentiment classification, Applied Intelligence, 51, 6, pp. 3174-3188, (2021)
  • [10] WANG S G, LI D Y, LI Y., Sentiment mining of commodity reputation data based on joint model, Journal of Tsinghua University (Science and Technology), 57, 9, pp. 926-931, (2017)