Text Classification Model Based on fastText

被引:0
|
作者
Yao, Tengjun [1 ]
Zhai, Zhengang [1 ]
Gao, Bingtao [1 ]
机构
[1] China Elect Technol Grp Corp, Inst 36, Jiaxing, Peoples R China
关键词
Machine learning; text classification; feature engineering; emotional polarity judgment;
D O I
10.1109/icaiis49377.2020.9194939
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most text classification models based on traditional machine learning algorithms have problems such as curse of dimensionality and poor performance. In order to solve the above problems, this paper proposes a text classification model based on fastText. Our model explores the important information contained in the text through the feature engineering, and obtains the low-dimensional, continuous and high-quality text representation through the fastText algorithm. The experiment is based on Python to classify the text dataset of "user comment data emotional polarity judgment" in Baidu Dianshi platform. In the emotional polarity judgment task, the experimental results show that the precision, recall and F values of our model are superior to the model based on traditional machine learning algorithms and have excellent classification performance.
引用
收藏
页码:154 / 157
页数:4
相关论文
共 50 条
  • [31] ProtPlat: an efficient pre-training platform for protein classification based on FastText
    Jin, Yuan
    Yang, Yang
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [32] Security bug reports classification using fasttext
    Sultan S. Alqahtani
    International Journal of Information Security, 2024, 23 : 1347 - 1358
  • [33] Malware Detection and Classification Using fastText and BERT
    Yesir, Salih
    Sogukpinar, Ibrahim
    9TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS'21), 2021,
  • [34] ProtPlat: an efficient pre-training platform for protein classification based on FastText
    Yuan Jin
    Yang Yang
    BMC Bioinformatics, 23
  • [35] Study a Text Classification Method Based on Neural Network Model
    Chen, Jian
    Pan, Hailan
    Ao, Qinyun
    ADVANCES IN MULTIMEDIA, SOFTWARE ENGINEERING AND COMPUTING, VOL 1, 2011, 128 : 471 - 475
  • [36] A Bayesian Classifiers based Combination Model for Automatic Text Classification
    Rahman, Amna
    Qamar, Usman
    PROCEEDINGS OF 2016 IEEE 7TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2016), 2016, : 63 - 67
  • [37] HCapsNet: A Text Classification Model Based on Hierarchical Capsule Network
    Li, Ying
    Ye, Ming
    Hu, Qian
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2021, PT II, 2021, 12816 : 538 - 549
  • [38] A Chinese Text Classification Model Based on Radicals and Character Distinctions
    Yan-Xin, Huang
    Bo, Li
    IEEE ACCESS, 2023, 11 : 45520 - 45526
  • [39] KGAT: An Enhanced Graph-Based Model for Text Classification
    Wang, Xin
    Wang, Chao
    Yang, Haiyang
    Zhang, Xingpeng
    Shen, Qi
    Ji, Kan
    Wu, Yuhong
    Zhan, Huayi
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 656 - 668
  • [40] Microblog Text Classification System Based on TextCNN and LSA Model
    Zhang, Weiyu
    Xu, Can
    2020 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, COMPUTER TECHNOLOGY AND TRANSPORTATION (ISCTT 2020), 2020, : 469 - 474