TopicBERT: A Topic-Enhanced Neural Language Model Fine-Tuned for Sentiment Classification

被引:18
|
作者
Zhou, Yuxiang [1 ]
Liao, Lejian [1 ]
Gao, Yang [1 ]
Wang, Rui [2 ]
Huang, Heyan [1 ]
机构
[1] Beijing Inst Technol, Fac Comp Sci, Beijing 100081, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Fac Comp Sci, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Bit error rate; Semantics; Predictive models; Training; Context modeling; Social networking (online); Bidirectional encoder representations from transformers (BERT); pretrained neural language model; sentiment classification; topic-enhanced neural network;
D O I
10.1109/TNNLS.2021.3094987
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment classification is a form of data analytics where people's feelings and attitudes toward a topic are mined from data. This tantalizing power to ``predict the zeitgeist'' means that sentiment classification has long attracted interest, but with mixed results. However, the recent development of the BERT framework and its pretrained neural language models is seeing new-found success for sentiment classification. BERT models are trained to capture word-level information via mask language modeling and sentence-level contexts via next sentence prediction tasks. Out of the box, they are adequate models for some natural language processing tasks. However, most models are further fine-tuned with domain-specific information to increase accuracy and usefulness. Motivated by the idea that a further fine-tuning step would improve the performance for downstream sentiment classification tasks, we developed TopicBERT--a BERT model fine-tuned to recognize topics at the corpus level in addition to the word and sentence levels. TopicBERT comprises two variants: TopicBERT-ATP (aspect topic prediction), which captures topic information via an auxiliary training task, and TopicBERT-TA, where topic representation is directly injected into a topic augmentation layer for sentiment classification. With TopicBERT-ATP, the topics are predetermined by an LDA mechanism and collapsed Gibbs sampling. With TopicBERT-TA, the topics can change dynamically during the training. Experimental results show that both approaches deliver the state-of-the-art performance in two different domains with SemEval 2014 Task 4. However, in a test of methods, direct augmentation outperforms further training. Comprehensive analyses in the form of ablation, parameter, and complexity studies accompany the results.
引用
下载
收藏
页码:380 / 393
页数:14
相关论文
共 50 条
  • [1] A topic-enhanced word embedding for Twitter sentiment classification
    Ren, Yafeng
    Wang, Ruimin
    Ji, Donghong
    INFORMATION SCIENCES, 2016, 369 : 188 - 198
  • [2] Fine-Tuned Transformer Model for Sentiment Analysis
    Liu, Sishun
    Shuai, Pengju
    Zhang, Xiaowu
    Chen, Shuang
    Li, Li
    Liu, Ming
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 336 - 343
  • [3] AirBERT: A fine-tuned language representation model for airlines tweet sentiment analysis
    Yenkikar, Anuradha
    Babu, C. Narendra
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2023, 17 (02): : 435 - 455
  • [4] Website Category Classification Using Fine-tuned BERT Language Model
    Demirkiran, Ferhat
    Cayir, Aykut
    Unal, Ugur
    Dag, Hasan
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 333 - 336
  • [5] Arabic sarcasm detection: An enhanced fine-tuned language model approach
    Galal, Mohamed A.
    Yousef, Ahmed Hassan
    Zayed, Hala H.
    Medhat, Walaa
    AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (06)
  • [6] A topic-enhanced recurrent autoencoder model for sentiment analysis of short texts
    Wu S.
    Gao M.
    Xiao Q.
    Zou G.
    Wu, Shaochun (scwu@shu.edu.cn), 1600, Inderscience Publishers (07): : 393 - 406
  • [7] Enhancing Zero-Shot Crypto Sentiment With Fine-Tuned Language Model and Prompt Engineering
    Wahidur, Rahman S. M.
    Tashdeed, Ishmam
    Kaur, Manjit
    Lee, Heung-No
    IEEE ACCESS, 2024, 12 : 10146 - 10159
  • [8] Melanoma identification and classification model based on fine-tuned convolutional neural network
    Almufareh, Maram F.
    Tariq, Noshina
    Humayun, Mamoona
    Khan, Farrukh Aslam
    DIGITAL HEALTH, 2024, 10
  • [9] A fine-tuned convolutional neural network model for accurate Alzheimer’s disease classification
    Muhammad Zahid Hussain
    Tariq Shahzad
    Shahid Mehmood
    Kainat Akram
    Muhammad Adnan Khan
    Muhammad Usman Tariq
    Arfan Ahmed
    Scientific Reports, 15 (1)
  • [10] Neuro or Symbolic? Fine-Tuned Transformer With Unsupervised LDA Topic Clustering for Text Sentiment Analysis
    Ding, Fei
    Kang, Xin
    Ren, Fuji
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (02) : 493 - 507