Research on feature classification method of network text data based on association rules

被引:1
|
作者
Huang H. [1 ]
机构
[1] Guangxi Colleges and Universities Key Laboratory of Image Processing and Intelligent Information System, Wuzhou University, Wuzhou
关键词
Association rules; network text data; support vector machine;
D O I
10.1080/1206212X.2018.1475333
中图分类号
学科分类号
摘要
Due to the large number of features, sparse data, low precision of feature extraction, and long time-consuming. Using the current method to classify the text data features, it is difficult to achieve better results. A method of feature classification of network text data based on association rules is proposed. A single word is used as a classification feature to extract the association feature item of text data. The feature item after dimension reduction is to construct classifier. Through the support vector machine algorithm, the network text data feature classification is realized. The highest precision ratio of Hangxia et al.’s study [Hangxia Z, Jiajun Y, Huan R. Text categorization based on deep belief network. Comput Eng Sci. 2016;38(5):871–876] is 69%, the highest precision ratio of Wenjuan et al.’s study [Wenjuan S, Shun L, Fei Y. Iterative text classification framework based on background learning. Comput Eng Applic. 2015;51(9):129–134] is 85%, and the highest precision ratio of the proposed method is 93%. The precision of this method is higher, which shows that the method can accurately reflect the feature class information of text data and reduce the error rate of text classification. Experimental results show that the proposed method can improve the accuracy of classification results and has high robustness. © 2018, © 2018 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:157 / 163
页数:6
相关论文
共 50 条
  • [31] Research on Enterprise Hidden Danger Association Rules Based on Text Analysis
    Ge, Shengxin
    Zhuang, Yufeng
    Hu, Yanzhu
    Ai, Xinbo
    2018 4TH INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SCIENCE AND MATERIAL APPLICATION, 2019, 252
  • [32] An Association Rules-Based Method for Outliers Cleaning of Measurement Data in the Distribution Network
    Kuang, Hua
    Qin, Risheng
    He, Mi
    He, Xin
    Duan, Ruimin
    Guo, Cheng
    Meng, Xian
    FRONTIERS IN ENERGY RESEARCH, 2021, 9
  • [33] Genetic Network Programming based data mining method for extracting fuzzy association rules
    Taboada, Karla
    Gonzales, Eloy
    Shimada, Kaoru
    Mabu, Shingo
    Hirasawa, Kotaro
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 1756 - 1763
  • [34] The Research Of Feature Selection Of Text Classification Based On Integrated Learning Algorithm
    Xia Huosong
    Liu Jian
    2011 TENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE (DCABES), 2011, : 20 - 22
  • [35] RESEARCH OF DATA MINING ALGORITHM BASED ON ASSOCIATION RULES
    Song, Changxin
    Ma, Ke
    PROCEEDINGS OF THE 2011 3RD INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATION (ICFCC 2011), 2011, : 243 - +
  • [36] An improvement of text association classification using rules weights
    Chen, XY
    Chen, Y
    Li, RL
    Hu, YF
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 355 - 363
  • [37] A New Feature Selection Method for Text Classification Based on Independent Feature Space Search
    Liu, Yong
    Ju, Shenggen
    Wang, Junfeng
    Su, Chong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [38] A feature selection method based on synonym merging in text classification system
    Haipeng Yao
    Chong Liu
    Peiying Zhang
    Luyao Wang
    EURASIP Journal on Wireless Communications and Networking, 2017
  • [39] Few-shot Text Classification Method Based on Feature Optimization
    Peng, Jing
    Huo, Shuquan
    JOURNAL OF WEB ENGINEERING, 2023, 22 (03): : 497 - 514
  • [40] A feature selection method based on synonym merging in text classification system
    Yao, Haipeng
    Liu, Chong
    Zhang, Peiying
    Wang, Luyao
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2017,