Fusion of XLNet and BiLSTM-TextCNN for Weibo Sentiment Analysis in Spark Big Data Environment

被引:5
|
作者
Li A. [1 ,2 ]
Li T. [2 ]
机构
[1] College of Information and Electrical Engineering, Heilongjiang Bayi Agricultural University
关键词
BiLSTM; Spark; Text Sentiment Analysis; TextCNN; XLNet;
D O I
10.4018/IJACI.331744
中图分类号
学科分类号
摘要
This article proposes a Weibo sentiment analysis method to improve traditional algorithms' analysis efficiency and accuracy. The proposed algorithm uses deep learning in the Spark big data environment. First, the input data are converted into dynamic word vector representations using the Chinese version of the XLNet model. Then, dual-channel feature extraction is performed on the data using TextCNN and BiLSTM. The proposed algorithm uses an attention mechanism to allocate computing resources efficiently and realizes feature fusion and data classification. Comparative experiments are conducted on two public datasets under identical experimental conditions. In the NLPCC2014 and NLPCC2015 datasets, the proposed model improves the precision and F1 metrics by at least 4.26% and 2.64%, respectively. In the weibo_senti_100k dataset, the proposed model improves the precision and F1 metrics by at least 4.66% and 2.69%, respectively. The results indicate that the proposed method has better sentiment analysis and prediction abilities than existing methods. © 2023 IGI Global. All rights reserved.
引用
收藏
相关论文
共 50 条
  • [21] Sentiment mining in a collaborative learning environment: capitalising on big data
    Jena, R. K.
    BEHAVIOUR & INFORMATION TECHNOLOGY, 2019, 38 (09) : 986 - 1001
  • [22] A Peculiar Sentiment Analysis Advancement in Big Data
    Valera, Manisha
    Patel, Yash
    10TH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2018, 933
  • [23] Genetic Optimization of Big Data Sentiment Analysis
    Povoda, Lukas
    Burget, Radim
    Dutta, Malay Kishore
    Sengar, Namita
    2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 141 - 144
  • [24] Sentiment Analysis of Weibo Posts on Public Health Emergency with Feature Fusion and Multi-Channel
    Pu H.
    Wei Z.
    Zhanpeng Z.
    Yuxin W.
    Haoyu F.
    Data Analysis and Knowledge Discovery, 2021, 5 (11): : 68 - 79
  • [25] An Implementation of Hybrid Enhanced Sentiment Analysis System using Spark ML Pipeline: A Big Data Analytics Framework
    Raviya, K.
    Vennila, Mary S.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (05) : 323 - 329
  • [26] A Study on Micro-blog Sentiment Analysis of Public Emergencies under the Environment of Big Data
    Wu, Zeshu
    Lu, Yanxia
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 4435 - 4438
  • [27] Sentiment classification using paragraph vector and cognitive big data semantics on Apache Spark
    Ravi, Kumar
    Ravi, Vadlamani
    Shivakrishna, B.
    PROCEEDINGS OF 2018 IEEE 17TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2018), 2018, : 187 - 194
  • [28] Classifier Fusion by Judgers on Spark Clusters for Multimedia Big Data Classification
    Yan, Yilin
    Zhu, Qiusha
    Shyu, Mei-Ling
    Chen, Shu-Ching
    QUALITY SOFTWARE THROUGH REUSE AND INTEGRATION, 2018, 561 : 91 - 108
  • [29] Action Rules for Sentiment Analysis on Twitter Data using Spark
    Ranganathan, Jaishree
    Irudayaraj, Allen S.
    Tzacheva, Angelina A.
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 51 - 60
  • [30] Sentiment Analysis on Twitter Data using Apache Spark Framework
    Elzayady, Hossam
    Badran, Khaled M.
    Salama, Gouda I.
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 171 - 176