Feature selection algorithms to improve documents' classification performance

被引:0
|
作者
Sousa, PAC [1 ]
Pimentao, JP
Santos, BRD
Moura-Pires, F
机构
[1] Univ Nova Lisboa, Fac Ciencias & Tecnol, Caparica, Portugal
[2] Univ Nova Lisboa, Inst Desenvolvimento Novas Tecnol, Caparica, Portugal
[3] Univ Evora, Dept Informat, Evora, Portugal
来源
ADVANCES IN WEB INTELLIGENCE | 2003年 / 2663卷
关键词
text learning; multi-agents systems; feature selection; information retrieval;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a study where feature selection algorithms were evaluated in order to improve documents' classification performance. The study was made during the project DEEPSIA, IST project Nr. 1999-20 283, funded by the European Union. The need to improve documents recognition was imposed by the need to increase the overall performance of the Framework for Internet data collection based on intelligent agents, used within the project. The Framework is briefly described and the learning techniques used are presented. The focus of this paper is on the feature selection algorithms, where the most relevant work was the use of Conditional Mutual Information, estimated using genetic algorithms, since the computational complexity of C-K(N) invalidated an iterative approach. Methods, techniques and comparative results are presented in detail.
引用
收藏
页码:288 / 296
页数:9
相关论文
共 50 条
  • [1] Using Cost-Sensitive Learning and Feature Selection Algorithms to Improve the Performance of Imbalanced Classification
    Feng, Fang
    Li, Kuan-Ching
    Shen, Jun
    Zhou, Qingguo
    Yang, Xuhui
    [J]. IEEE ACCESS, 2020, 8 : 69979 - 69996
  • [2] Performance Evaluation of Feature Selection Algorithms on Human Activity Classification
    Tulum, Gokalp
    Artug, N. Tugrul
    Bolat, Bulent
    [J]. 2013 IEEE INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (IEEE INISTA), 2013,
  • [3] Genetic Algorithms and Feature Selection for Improving the Classification Performance in Healthcare
    Alassaf, Alaa
    Alarbeed, Eman
    Alrasheed, Ghady
    Almirdasie, Abdulsalam
    Almutairi, Shahd
    Al-Hagery, Mohammed Abullah
    Saeed, Faisal
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 737 - 744
  • [4] Feature selection and text classification for Chinese web documents
    Xu, JC
    Liu, DY
    Hu, M
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1304 - 1309
  • [5] Feature Bundles and their Effect on the Performance of Tree-based Evolutionary Classification and Feature Selection Algorithms
    Neshatian, Kourosh
    Varn, Lucianne
    [J]. 2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 1612 - 1619
  • [6] Comparison of classification algorithms using feature selection
    Juarez-Lopez, Alexander
    Hernandez-Torruco, Jose
    Hernandez-Ocana, Betania
    Chavez-Bosquez, Oscar
    [J]. 2021 MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC 2021), 2021,
  • [7] Genetic algorithms for clustering, feature selection and classification
    Tseng, LY
    Yang, SB
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 1612 - 1616
  • [8] Performance evaluation of feature selection and tree-based algorithms for traffic classification
    Aouedi, Ons
    Piamrat, Kandaraj
    Parrein, Benoit
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [9] Leveraging Multimodal and Feature Selection Approaches to Improve Sleep Apnea Classification Performance
    Memis, Gokhan
    Sert, Mustafa
    Yazici, Adnan
    [J]. 2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [10] FEATURE SELECTION ALGORITHMS TO IMPROVE THYROID DISEASE DIAGNOSIS
    Pavya, K.
    Srinivasan, B.
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN GREEN ENERGY AND HEALTHCARE TECHNOLOGIES (IGEHT), 2017,