MII: A Novel Text Classification Model Combining Deep Active Learning with BERT

被引:11
|
作者
Zhang, Anman [1 ]
Li, Bohan [1 ,2 ,3 ]
Wang, Wenhuan [1 ]
Wan, Shuo [1 ]
Chen, Weitong [4 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Minist Ind & Informat Technol, Key Lab Safety Crit Software, Nanjing 211106, Jiangsu, Peoples R China
[3] Collaborat Innovat Ctr Novel Software Technol & I, Nanjing 210046, Peoples R China
[4] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2020年 / 63卷 / 03期
基金
中国国家自然科学基金;
关键词
Active learning; instance selection; deep neural network; text classification;
D O I
10.32604/cmc.2020.09962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Active learning has been widely utilized to reduce the labeling cost of supervised learning. By selecting specific instances to train the model, the performance of the model was improved within limited steps. However, rare work paid attention to the effectiveness of active learning on it. In this paper, we proposed a deep active learning model with bidirectional encoder representations from transformers (BERT) for text classification. BERT takes advantage of the self-attention mechanism to integrate contextual information, which is beneficial to accelerate the convergence of training. As for the process of active learning, we design an instance selection strategy based on posterior probabilities Margin, Intra-correlation and Inter-correlation (MII). Selected instances are characterized by small margin, low intra-cohesion and high inter-cohesion. We conduct extensive experiments and analytics with our methods. The effect of learner is compared while the effect of sampling strategy and text classification is assessed from three real datasets. The results show that our method outperforms the baselines in terms of accuracy.
引用
收藏
页码:1499 / 1514
页数:16
相关论文
共 50 条
  • [21] A novel deep learning by combining discriminative model with generative model
    Kim, Sangwook
    Lee, Minho
    Shen, Jixiang
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [22] Emotion Classification of Text Based on BERT and Broad Learning System
    Peng, Sancheng
    Zeng, Rong
    Liu, Hongzhan
    Chen, Guanghao
    Wu, Ruihuan
    Yang, Aimin
    Yu, Shui
    [J]. WEB AND BIG DATA, APWEB-WAIM 2021, PT I, 2021, 12858 : 382 - 396
  • [23] RETRACTED: Novel Multirole-Oriented Deep Learning Text Classification Model (Retracted Article)
    Luo, Ting
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [24] Weibo Text Sentiment Analysis Based on BERT and Deep Learning
    Li, Hongchan
    Ma, Yu
    Ma, Zishuai
    Zhu, Haodong
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (22):
  • [25] DPTCN: A novel deep CNN model for short text classification
    Yu, Shujuan
    Liu, Danlei
    Zhang, Yun
    Zhao, Shengmei
    Wang, Weigang
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 7093 - 7100
  • [26] A vehicle classification model based on deep active learning
    Wang, Xuanhong
    Yang, Shiyu
    Xiao, Yun
    Zheng, Xia
    Gao, Shuai
    Zhou, Jincheng
    [J]. PATTERN RECOGNITION LETTERS, 2023, 171 : 84 - 91
  • [27] Deep Active Learning for Address Parsing Tasks with BERT
    Guler, Berkay
    Aygun, Betul
    Gerek, Aydin
    Gurel, Alaeddin Selcuk
    [J]. 2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [28] A Novel System for Image Text Recognition and Classification using Deep Learning
    Manzoor, Syed Ishfaq
    Singla, Jimmy
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS 2021), 2021, : 61 - 64
  • [29] Text Classification Research Based on Bert Model and Bayesian Network
    Liu, Songsong
    Tao, Haijun
    Feng, Shiling
    [J]. 2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 5842 - 5846
  • [30] Cross-Domain Text Classification Based on BERT Model
    Zhang, Kuan
    Hei, Xinhong
    Fei, Rong
    Guo, Yufan
    Jiao, Rui
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS: DASFAA 2021 INTERNATIONAL WORKSHOPS, 2021, 12680 : 197 - 208