Leveraging Label-Independent Features for Classification in Sparsely Labeled Networks: An Empirical Study

被引:21
|
作者
Gallagher, Brian [1 ]
Eliassi-Rad, Tina [1 ]
机构
[1] Lawrence Livermore Natl Lab, Livermore, CA 94551 USA
关键词
Statistical relational learning; semi-supervised learning; social network analysis; feature extraction; collective classification;
D O I
10.1109/AOM.2010.5713574
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address the problem of within-network classification in sparsely labeled networks Recent work has demonstrated success with statistical relational learning (SRL) and semi-supervised learning (SSL) on such problems However; both approaches rely on the availability of labeled nodes to infer the values of missing labels When few labels are available; the performance of these approaches can degrade In addition, many such approaches are sensitive to the specific set of nodes labeled So; although average performance may be acceptable, the performance on a specific task may not We explore a complimentary approach to within-network classification, based on the use of label-independent (LI) features i.e., features calculated without using the values of class labels While previous work has made sonic use of LE features; the effects of these features on classification performance have not been extensively studied Here, we present an empirical study in order to better understand these effects Through experiments on several real-world data sets, we show that the use of LI features produces classifiers that are less sensitive to specific label assignments and can lead to performance improvements of over 40% for both SRL- and SSL-based classifiers We also examine the relative utility of individual LI features, and show that, in many cases, it is a combination of a few diverse network-based structural characteristics that is most informative
引用
收藏
页码:1 / 19
页数:19
相关论文
共 40 条
  • [21] Automatic Classification of Sexism in Social Networks: An Empirical Study on Twitter Data
    Rodriguez-Sanchez, Francisco
    Carrillo-de-Albornoz, Jorge
    Plaza, Laura
    [J]. IEEE ACCESS, 2020, 8 : 219563 - 219576
  • [22] An Empirical Study on Software Failure Classification with Multi-Label and Problem-Transformation Techniques
    Feng, Yang
    Jones, James A.
    Chen, Zhenyu
    Fang, Chunrong
    [J]. 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION (ICST), 2018, : 320 - 330
  • [23] A study of cloud classification with neural networks using spectral and textural features
    Tian, B
    Shaikh, MA
    Azimi-Sadjadi, MR
    Vonder Haar, TH
    Reinke, DL
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (01): : 138 - 151
  • [24] The effect of node features on GCN-based brain network classification: an empirical study
    Wang, Guangyu
    Zhang, Limei
    Qiao, Lishan
    [J]. PEERJ, 2023, 11
  • [25] Understanding the Roles of Sub-graph Features for Graph Classification: An Empirical Study Perspective
    Guo, Ting
    Zhu, Xingquan
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 817 - 822
  • [26] Performance of Classifiers on Noisy-Labeled Training Data: An Empirical Study on Handwritten Digit Classification Task
    Ahmad, Irfan
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 414 - 425
  • [27] Circular/wrap-around self-organizing map networks: an empirical study in clustering and classification
    Kiang, MY
    Kulkarni, UR
    St Louis, R
    [J]. JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2001, 52 (01) : 93 - 101
  • [28] Impact of Membership and Non-membership Features on Classification Decision: An Empirical Study for Appraisal of Feature Selection Methods
    Abbasi, Bushra Zaheer
    Hussain, Shahid
    Bibi, Shaista
    Shah, Munam Ali
    [J]. 2018 24TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING (ICAC' 18), 2018, : 454 - 459
  • [29] An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels
    Chalkidis, Ilias
    Fergadiotis, Manos
    Kotitsas, Sotiris
    Malakasiotis, Prodromos
    Aletras, Nikolaos
    Androutsopoulos, Ion
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7503 - 7515
  • [30] Bayesian inference for multi-label classification for root cause analysis and probe card maintenance decision support and an empirical study
    Chien, Chen-Fu
    Peng, Jia-Yu
    [J]. JOURNAL OF INTELLIGENT MANUFACTURING, 2024,