Barrage Text Classification with Improved Active Learning and CNN

被引:0
|
作者
Qiu, Ningjia [1 ]
Cong, Lin [1 ]
Zhou, Sicheng [1 ]
Wang, Peng [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, 7186 Weixing Rd, Changchun 130022, Jilin, Peoples R China
关键词
CNN; SVD; active learning; gradient descent; text classification;
D O I
10.20965/jaciii.2019.p0980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional convolutional neural networks (CNNs) use a pooling layer to reduce the dimensionality of texts, but lose semantic information. To solve this problem, this paper proposes a convolutional neural network model based on singular value decomposition algorithm (SVD-CNN). First, an improved density-based center point clustering active learning sampling algorithm (DBC-AL) is used to obtain a high-quality training set at a low labelling cost. Second, the method uses the singular value decomposition algorithm for feature extraction and dimensionality reduction instead of a pooling layer, fuses the dimensionality reduction matrix, and completes the barrage text classification task. Finally, the partial sampling gradient descent algorithm (PSGD) is applied to optimize the model parameters, which accelerates the convergence speed of the model while ensuring stability of the model training. To verify the effectiveness of the improved algorithm, several barrage datasets were used to compare the proposed model and common text classification models. The experimental results show that the improved algorithm preserves the semantic features of the text more successfully, ensures the stability of the training process, and improves the convergence speed of the model. Further, the model's classification performance on different barrage texts is superior to traditional algorithms.
引用
收藏
页码:980 / 989
页数:10
相关论文
共 50 条
  • [31] Empirical Comparisons of CNN with Other Learning Algorithms for Text Classification in Legal Document Review
    Keeling, Robert
    Chhatwal, Rishi
    Huber-Fliflet, Nathaniel
    Zhang, Jianping
    Wei, Fusheng
    Zhao, Haozhen
    Shi, Ye
    Qin, Han
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 2038 - 2042
  • [32] Learning CNN architecture for multi-view text classification using genetic algorithms
    Sargisson, Finn
    Gao, Xiaoying
    Xue, Bing
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1507 - 1514
  • [33] Fusion of vectored text descriptors with auto extracted deep CNN features for improved image classification
    Thepade, Sudeep D.
    Jaison, Jovian A.
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2022, 16
  • [34] A method of improved CNN traffic classification
    Zhou, Huiyi
    Wang, Yong
    Lei, Xiaochun
    Liu, Yuming
    2017 13TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2017, : 177 - 181
  • [35] SF-CNN: Deep Text Classification and Retrieval for Text Documents
    Sarasu, R.
    Thyagharajan, K. K.
    Shanker, N. R.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02): : 1799 - 1813
  • [36] Active Learning with Query Generation for Cost-Effective Text Classification
    Yan, Yi-Fan
    Huang, Sheng-Jun
    Chen, Shaoyi
    Liao, Meng
    Xu, Jin
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6583 - 6590
  • [37] Active Learning for Text Classification: Using the LSI Subspace Signature Model
    Zhu, Weizhong
    Allen, Robert B.
    2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 149 - 155
  • [38] Active learning for clinical text classification: is it better than random sampling?
    Figueroa, Rosa L.
    Zeng-Treitler, Qing
    Ngo, Long H.
    Goryachev, Sergey
    Wiechmann, Eduardo P.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (05) : 809 - 816
  • [39] Active Learning Improved by Neighborhoods and Superpixels for Hyperspectral Image Classification
    Xue, Zhaohui
    Zhou, Shaoguang
    Zhao, Pengfei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (03) : 469 - 473
  • [40] Automatic Classification of Vulnerabilities Based on CNN and Text Semantics
    Qu L.-Y.
    Jia Y.-Z.
    Hao Y.-L.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2019, 39 (07): : 738 - 742