Barrage Text Classification with Improved Active Learning and CNN

被引:0
|
作者
Qiu, Ningjia [1 ]
Cong, Lin [1 ]
Zhou, Sicheng [1 ]
Wang, Peng [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, 7186 Weixing Rd, Changchun 130022, Jilin, Peoples R China
关键词
CNN; SVD; active learning; gradient descent; text classification;
D O I
10.20965/jaciii.2019.p0980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional convolutional neural networks (CNNs) use a pooling layer to reduce the dimensionality of texts, but lose semantic information. To solve this problem, this paper proposes a convolutional neural network model based on singular value decomposition algorithm (SVD-CNN). First, an improved density-based center point clustering active learning sampling algorithm (DBC-AL) is used to obtain a high-quality training set at a low labelling cost. Second, the method uses the singular value decomposition algorithm for feature extraction and dimensionality reduction instead of a pooling layer, fuses the dimensionality reduction matrix, and completes the barrage text classification task. Finally, the partial sampling gradient descent algorithm (PSGD) is applied to optimize the model parameters, which accelerates the convergence speed of the model while ensuring stability of the model training. To verify the effectiveness of the improved algorithm, several barrage datasets were used to compare the proposed model and common text classification models. The experimental results show that the improved algorithm preserves the semantic features of the text more successfully, ensures the stability of the training process, and improves the convergence speed of the model. Further, the model's classification performance on different barrage texts is superior to traditional algorithms.
引用
收藏
页码:980 / 989
页数:10
相关论文
共 50 条
  • [21] Spectral Clustering based Active Learning with Applications to Text Classification
    Guo, Wenbo
    Zhong, Chun
    Yang, Yupu
    2016 8TH INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2016), 2016, 56
  • [22] Improving Probabilistic Models In Text Classification Via Active Learning
    Bosley, Mitchell
    Kuzushima, Saki
    Enamorado, Ted
    Shiraito, Yuki
    AMERICAN POLITICAL SCIENCE REVIEW, 2024,
  • [23] Support vector machine active learning with applications to text classification
    Tong, S
    Koller, D
    JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (01) : 45 - 66
  • [24] Impact of Batch Size on Stopping Active Learning for Text Classification
    Beatty, Garrett
    Kochis, Ethan
    Bloodgood, Michael
    2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2018, : 306 - 307
  • [25] Effective Multi-Label Active Learning for Text Classification
    Yang, Bishan
    Sun, Jian-Tao
    Wang, Tengjiao
    Chen, Zheng
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 917 - 925
  • [26] Combining active learning and relevance vector machines for text classification
    Silva, C.
    Ribeiro, B.
    ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, : 130 - +
  • [27] A Novel Active Learning Method Using SVM for Text Classification
    Goudjil M.
    Koudil M.
    Bedda M.
    Ghoggali N.
    International Journal of Automation and Computing, 2018, 15 (03) : 290 - 298
  • [28] Impact of Stop Sets on Stopping Active Learning for Text Classification
    Kurlandski, Luke
    Bloodgood, Michael
    16TH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2022), 2022, : 25 - 32
  • [29] Active Learning Strategies for Multi-Label Text Classification
    Esuli, Andrea
    Sebastiani, Fabrizio
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 102 - +
  • [30] Transfer Learning Method for Very Deep CNN for Text Classification and Methods for its Evaluation
    Moriya, Shun
    Shibata, Chihiro
    2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC 2018), VOL 2, 2018, : 153 - 158