Barrage Text Classification with Improved Active Learning and CNN

被引:0
|
作者
Qiu, Ningjia [1 ]
Cong, Lin [1 ]
Zhou, Sicheng [1 ]
Wang, Peng [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, 7186 Weixing Rd, Changchun 130022, Jilin, Peoples R China
关键词
CNN; SVD; active learning; gradient descent; text classification;
D O I
10.20965/jaciii.2019.p0980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional convolutional neural networks (CNNs) use a pooling layer to reduce the dimensionality of texts, but lose semantic information. To solve this problem, this paper proposes a convolutional neural network model based on singular value decomposition algorithm (SVD-CNN). First, an improved density-based center point clustering active learning sampling algorithm (DBC-AL) is used to obtain a high-quality training set at a low labelling cost. Second, the method uses the singular value decomposition algorithm for feature extraction and dimensionality reduction instead of a pooling layer, fuses the dimensionality reduction matrix, and completes the barrage text classification task. Finally, the partial sampling gradient descent algorithm (PSGD) is applied to optimize the model parameters, which accelerates the convergence speed of the model while ensuring stability of the model training. To verify the effectiveness of the improved algorithm, several barrage datasets were used to compare the proposed model and common text classification models. The experimental results show that the improved algorithm preserves the semantic features of the text more successfully, ensures the stability of the training process, and improves the convergence speed of the model. Further, the model's classification performance on different barrage texts is superior to traditional algorithms.
引用
收藏
页码:980 / 989
页数:10
相关论文
共 50 条
  • [1] Text classification with active learning
    Novak, B
    Mladenic, D
    Grobelnik, M
    FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 398 - +
  • [2] Active learning for text classification with reusability
    Hu, Rong
    Mac Namee, Brian
    Delany, Sarah Jane
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 45 : 438 - 449
  • [3] Active Learning for Turkish Text Classification
    Sapci, Ali Osman Berk
    Tastan, Oznur
    Yeniterzi, Reyyan
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [4] Deep Active Learning for Text Classification
    An, Bang
    Wu, Wenjun
    Han, Huimin
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING (ICVISP 2018), 2018,
  • [5] News Text Classification Based on Improved Bi-LSTM-CNN
    Li, Chenbin
    Zhan, Guohua
    Li, Zhihua
    2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 890 - 893
  • [6] Research on Web Text Classification Algorithm Based on Improved CNN and SVM
    Wang, Zhiquan
    Qu, Zhiyi
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 1958 - 1961
  • [7] CoLAL: Co-learning Active Learning for Text Classification
    Le, Linh
    Zhao, Genghong
    Zhang, Xia
    Zuccon, Guido
    Demartini, Gianluca
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13337 - 13345
  • [8] Active Learning Based on Transfer Learning Techniques for Text Classification
    Onita, Daniela
    IEEE ACCESS, 2023, 11 : 28751 - 28761
  • [9] Transfer Learning to Timed Text Based Video Classification Using CNN
    Kastrati, Zenun
    Imran, Ali Shariq
    Kurti, Arianit
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, MINING AND SEMANTICS (WIMS 2019), 2019,
  • [10] Small-Text: Active Learning for Text Classification in Python']Python
    Schroeder, Christopher
    Mueller, Lydia
    Niekler, Andreas
    Potthast, Martin
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 84 - 95