IMPROVED TEXT FEATURE SELECTION ALGORITHMS IN CLASSIFICATION SEARCH OF ENVIRONMENTAL PROTECTION INFORMATION

被引:0
|
作者
Yang, Rongjie [1 ]
Man, Shuai [1 ]
机构
[1] Langfang Hlth Vocat Coll, Langfang, Hebei, Peoples R China
来源
JOURNAL OF ENVIRONMENTAL PROTECTION AND ECOLOGY | 2019年 / 20卷 / 03期
关键词
searching quality; characteristic variable; classification accuracy; artificial intelligence; OPTIMIZATION;
D O I
暂无
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The improvement of classification performance can improve the accuracy of data search engine. Through classification, we can find the information we need from the vast amount of network information to improve the search accuracy. Traditional text categorisation takes text search and selection as its basic algorithm, and its efficiency of search and classification is not high. In this paper, the authors studies are directed to an improved text feature selection algorithms based on decentralisation optimisation and its application in classification search of environmental protection information. This paper focuses on feature selection and discusses text categorisation. Aiming at the phenomenon that the concentration degree in TFFS algorithm tends to cause low frequent feature items to produce high weights and the role of feature items whose dispersion neglects the negative mutual information value in the categorisation, this paper improves the concentration and dispersion degree in TFFS. Finally, an improved feature selection is proposed based on the length information of feature items. The proposed algorithm reduces the weight of low frequent feature items and strengthens the role of negative feature in classification.
引用
收藏
页码:1462 / 1469
页数:8
相关论文
共 50 条
  • [1] Information-theoretic feature selection algorithms for text classification
    Novovicová, J
    Malík, A
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 3272 - 3277
  • [2] Feature selection using improved mutual information for text classification
    Novovicová, J
    Malík, A
    Pudil, P
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1010 - 1017
  • [3] Feature selection algorithm for text classification based on improved mutual information
    丛帅
    张积宾
    徐志明
    王宇颖
    Journal of Harbin Institute of Technology(New series), 2011, (03) : 144 - 148
  • [4] Feature Selection For Text Classification Using Genetic Algorithms
    Bidi, Noria
    Elberrichi, Zakaria
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 806 - 810
  • [5] An improved global feature selection scheme for text classification
    Uysal, Alper Kursat
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 43 : 82 - 92
  • [6] Feature Selection for Text Classification Using Mutual Information
    Sel, Ilhami
    Karci, Ali
    Hanbay, Davut
    2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [7] Improved Mutual Information Method For Text Feature Selection
    Ding Xiaoming
    Tang Yan
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 163 - 166
  • [8] Improved Document Feature Selection with Categorical Parameter for Text Classification
    Wang, Fen
    Li, Xiaoxuan
    Huang, Xiaotao
    Kang, Ling
    MOBILE, SECURE, AND PROGRAMMABLE NETWORKING (MSPN 2016), 2016, 10026 : 86 - 98
  • [9] Feature selection and machine learning algorithms for uyghur text sentiment classification
    Turhuntay, Raxida
    Slamu, Wushour
    Dawut, Abdusalam
    Hamdulla, Askar
    Turhun, Erxat
    Boletin Tecnico/Technical Bulletin, 2017, 55 (13): : 56 - 66
  • [10] A Comprehensive Study of Eleven Feature Selection Algorithms and their Impact on Text Classification
    Vora, Suchi
    Yang, Hui
    2017 COMPUTING CONFERENCE, 2017, : 440 - 449