Barrage Text Classification with Improved Active Learning and CNN

被引:0
|
作者
Qiu, Ningjia [1 ]
Cong, Lin [1 ]
Zhou, Sicheng [1 ]
Wang, Peng [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, 7186 Weixing Rd, Changchun 130022, Jilin, Peoples R China
关键词
CNN; SVD; active learning; gradient descent; text classification;
D O I
10.20965/jaciii.2019.p0980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional convolutional neural networks (CNNs) use a pooling layer to reduce the dimensionality of texts, but lose semantic information. To solve this problem, this paper proposes a convolutional neural network model based on singular value decomposition algorithm (SVD-CNN). First, an improved density-based center point clustering active learning sampling algorithm (DBC-AL) is used to obtain a high-quality training set at a low labelling cost. Second, the method uses the singular value decomposition algorithm for feature extraction and dimensionality reduction instead of a pooling layer, fuses the dimensionality reduction matrix, and completes the barrage text classification task. Finally, the partial sampling gradient descent algorithm (PSGD) is applied to optimize the model parameters, which accelerates the convergence speed of the model while ensuring stability of the model training. To verify the effectiveness of the improved algorithm, several barrage datasets were used to compare the proposed model and common text classification models. The experimental results show that the improved algorithm preserves the semantic features of the text more successfully, ensures the stability of the training process, and improves the convergence speed of the model. Further, the model's classification performance on different barrage texts is superior to traditional algorithms.
引用
收藏
页码:980 / 989
页数:10
相关论文
共 50 条
  • [41] An Analysis Method for Interpretability of CNN Text Classification Model
    Ce, Peng
    Tie, Bao
    FUTURE INTERNET, 2020, 12 (12): : 1 - 14
  • [42] LSTM-CNN Hybrid Model for Text Classification
    Zhang, Jiarui
    Li, Yingxiang
    Tian, Juan
    Li, Tongyan
    PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1675 - 1680
  • [43] An ensemble learning integration of multiple CNN with improved vision transformer models for pest classification
    Xia, Wanshang
    Han, Dezhi
    Li, Dun
    Wu, Zhongdai
    Han, Bing
    Wang, Junxiang
    ANNALS OF APPLIED BIOLOGY, 2023, 182 (02) : 144 - 158
  • [44] Active Learning for Biomedical Text Classification Based on Automatically Generated Regular Expressions
    Flores, Christopher A.
    Figueroa, Rosa L.
    Pezoa, Jorge E.
    Flores, Christopher A. (christopher.flores@biomedica.udec.cl), 1600, Institute of Electrical and Electronics Engineers Inc. (09): : 38767 - 38777
  • [45] Stopping Active Learning based on Predicted Change of F Measure for Text Classification
    Altschuler, Michael
    Bloodgood, Michael
    2019 13TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2019, : 47 - 54
  • [46] MII: A Novel Text Classification Model Combining Deep Active Learning with BERT
    Zhang, Anman
    Li, Bohan
    Wang, Wenhuan
    Wan, Shuo
    Chen, Weitong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 63 (03): : 1499 - 1514
  • [47] A Novel Active Learning Method Using SVM for Text Classification附视频
    Mohamed Goudjil
    Mouloud Koudil
    Mouldi Bedda
    Noureddine Ghoggali
    International Journal of Automation and Computing, 2018, (03) : 290 - 298
  • [48] Contrastive learning with text augmentation for text classification
    Jia, Ouyang
    Huang, Huimin
    Ren, Jiaxin
    Xie, Luodi
    Xiao, Yinyin
    APPLIED INTELLIGENCE, 2023, 53 (16) : 19522 - 19531
  • [49] Contrastive learning with text augmentation for text classification
    Ouyang Jia
    Huimin Huang
    Jiaxin Ren
    Luodi Xie
    Yinyin Xiao
    Applied Intelligence, 2023, 53 : 19522 - 19531
  • [50] Early Forecasting of Text Classification Accuracy and F-Measure with Active Learning
    Orth, Thomas
    Bloodgood, Michael
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2020), 2020, : 77 - 84