TopicBERT: A Topic-Enhanced Neural Language Model Fine-Tuned for Sentiment Classification

被引：18

作者：

Zhou, Yuxiang ^{[1
]}

Liao, Lejian ^{[1
]}

Gao, Yang ^{[1
]}

Wang, Rui ^{[2
]}

Huang, Heyan ^{[1
]}

机构：

[1] Beijing Inst Technol, Fac Comp Sci, Beijing 100081, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Fac Comp Sci, Nanjing 210023, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Task analysis; Bit error rate; Semantics; Predictive models; Training; Context modeling; Social networking (online); Bidirectional encoder representations from transformers (BERT); pretrained neural language model; sentiment classification; topic-enhanced neural network;

D O I：

10.1109/TNNLS.2021.3094987

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sentiment classification is a form of data analytics where people's feelings and attitudes toward a topic are mined from data. This tantalizing power to ``predict the zeitgeist'' means that sentiment classification has long attracted interest, but with mixed results. However, the recent development of the BERT framework and its pretrained neural language models is seeing new-found success for sentiment classification. BERT models are trained to capture word-level information via mask language modeling and sentence-level contexts via next sentence prediction tasks. Out of the box, they are adequate models for some natural language processing tasks. However, most models are further fine-tuned with domain-specific information to increase accuracy and usefulness. Motivated by the idea that a further fine-tuning step would improve the performance for downstream sentiment classification tasks, we developed TopicBERT--a BERT model fine-tuned to recognize topics at the corpus level in addition to the word and sentence levels. TopicBERT comprises two variants: TopicBERT-ATP (aspect topic prediction), which captures topic information via an auxiliary training task, and TopicBERT-TA, where topic representation is directly injected into a topic augmentation layer for sentiment classification. With TopicBERT-ATP, the topics are predetermined by an LDA mechanism and collapsed Gibbs sampling. With TopicBERT-TA, the topics can change dynamically during the training. Experimental results show that both approaches deliver the state-of-the-art performance in two different domains with SemEval 2014 Task 4. However, in a test of methods, direct augmentation outperforms further training. Comprehensive analyses in the form of ablation, parameter, and complexity studies accompany the results.

引用

下载

页码：380 / 393

页数：14

共 50 条

[41] Effectiveness of Fine-tuned BERT Model in Classification of Helpful and Unhelpful Online Customer Reviews
Bilal, Muhammad
Almazroi, Abdulwahab Ali
ELECTRONIC COMMERCE RESEARCH, 2023, 23 (04) : 2737 - 2757
[42] Racial Skew in Fine-Tuned Legal AI Language Models
Malic, Vincent Quirante
Kumari, Anamika
Liu, Xiaozhong
2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 245 - 252
[43] Improving RAG Quality for Large Language Models with Topic-Enhanced Reranking
Ampazis, Nicholas
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT II, AIAI 2024, 2024, 712 : 74 - 87
[44] Effectiveness of Fine-tuned BERT Model in Classification of Helpful and Unhelpful Online Customer Reviews
Muhammad Bilal
Abdulwahab Ali Almazroi
Electronic Commerce Research, 2023, 23 : 2737 - 2757
[45] Comprehensive Information Retrieval Using Fine-Tuned Bert Model and Topic-Assisted Query Expansion
Patro, Wilson
Niaz, Aaquib
Prasath, Rajendra
AMBIENT INTELLIGENCE IN HEALTH CARE, ICAIHC 2022, 2023, 317 : 117 - 132
[46] Improving the performance of aspect based sentiment analysis using fine-tuned Bert Base Uncased model
Geetha M.P.
Karthika Renuka D.
International Journal of Intelligent Networks, 2021, 2 : 64 - 69
[47] On the Generalization Abilities of Fine-Tuned Commonsense Language Representation Models
Shen, Ke
Kejriwal, Mayank
ARTIFICIAL INTELLIGENCE XXXVIII, 2021, 13101 : 3 - 16
[48] Fusing fine-tuned deep features for skin lesion classification
Mahbod, Amirreza
Schaefer, Gerald
Effinger, Isabella
Ecker, Rupert
Pitiot, Alain
Wang, Chunliang
COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2019, 71 : 19 - 29
[49] Retinal Image Quality Classification Using Fine-Tuned CNN
Sun, Jing
Wan, Cheng
Cheng, Jun
Yu, Fengli
Liu, Jiang
FETAL, INFANT AND OPHTHALMIC MEDICAL IMAGE ANALYSIS, 2017, 10554 : 126 - 133
[50] Deciphering language disturbances in schizophrenia: A study using fine-tuned language models
Li, Renyu
Cao, Minne
Fu, Dawei
Wei, Wei
Wang, Dequan
Yuan, Zhaoxia
Hu, Ruofei
Deng, Wei
SCHIZOPHRENIA RESEARCH, 2024, 271 : 120 - 128

← 1 2 3 4 5 →