Feature Selection for Collective Classification

被引:0
|
作者
Senliol, Baris [1 ]
Aral, Atakan [1 ]
Cataltepe, Zehra [1 ]
机构
[1] Istanbul Tech Univ, Dept Comp Engn, TR-80626 Istanbul, Turkey
关键词
Feature Selection; Collective Classification; Iterative Classification Algorithim (ICA); Minimum Redundancy Maximum Relevance (mRMR); Logistic Regression; Cora; Citeseer;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When in addition to node contents and labels, relations (links) between nodes and some unlabeled nodes are available, collective classification algorithms can be used. Collective classification algorithms, like ICA (Iterative Classification Algorithm), determine labels for the unlabeled nodes based on the contents and/or labels of the neighboring nodes. Feature selection algorithms have been shown to improve classification accuracy for traditional machine learning algorithms. In this paper, we use a recent and successful feature selection algorithm, mRMR (Minimum Redundancy Maximum Relevance, Ding and Peng, 2003), on content features. On two scientific paper citation data sets, Cora and Citeseer, when only content information is used, we ahow that the selected features may result in almost as good performance as all the features. When feature selection is performed both on content and link information, even better classification accuracies are obtained. Feature selection considerably reduces the training time for both content only and ICA algorithms.
引用
收藏
页码:285 / 290
页数:6
相关论文
共 50 条
  • [1] Feature Selection for Classification with QAOA
    Turati, Gloria
    Dacrema, Maurizio Ferrari
    Cremonesi, Paolo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON QUANTUM COMPUTING AND ENGINEERING (QCE 2022), 2022, : 782 - 785
  • [2] Feature Selection for Twitter Classification
    Ostrowski, David Alfred
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 267 - 272
  • [3] ONLINE FEATURE SELECTION AND CLASSIFICATION
    Kalkan, Habil
    Cetisli, Bayram
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2124 - 2127
  • [4] Feature Selection for Monotonic Classification
    Hu, Qinghua
    Pan, Weiwei
    Zhang, Lei
    Zhang, David
    Song, Yanping
    Guo, Maozu
    Yu, Daren
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (01) : 69 - 81
  • [5] Feature Selection for Gender Classification
    Zhang, Zhihong
    Hancock, Edwin R.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS: 5TH IBERIAN CONFERENCE, IBPRIA 2011, 2011, 6669 : 76 - 83
  • [6] Feature Selection in Text Classification
    Sahin, Durmus Ozkan
    Ates, Nurullah
    Kilic, Erdal
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1777 - 1780
  • [7] Sequential Feature Selection for Classification
    Rueckstiess, Thomas
    Osendorfer, Christian
    van der Smagt, Patrick
    [J]. AI 2011: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7106 : 132 - +
  • [8] FEATURE SELECTION BASED ON COMPLEMENTARITY OF FEATURE CLASSIFICATION CAPABILITY
    Gao, Fei
    Yu, Tian
    Wei, Yang
    Jin, Han
    Wei, Jin-Mao
    [J]. PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 130 - 135
  • [9] A Review on Feature Selection and Feature Extraction for Text Classification
    Shah, Foram P.
    Patel, Vibha
    [J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2264 - 2268
  • [10] Classification Algorithm Based on Feature Selection and Samples Selection
    Xu, Yitian
    Zhen, Ling
    Yang, Liming
    Wang, Laisheng
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 631 - 638