Hypergraph-based importance assessment for binary classification data

被引:1
|
作者
Misiorek, Pawel [1 ]
Janowski, Szymon [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Piotrowo 3, PL-60965 Poznan, Poland
关键词
Hypergraphs; Machine learning; Imbalanced data; Random undersampling; Feature selection; GRAPH EDIT DISTANCE; COMPUTATION; ALGORITHM; NETWORK;
D O I
10.1007/s10115-022-01786-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel hypergraph-based framework enabling an assessment of the importance of binary classification data elements. Specifically, we apply the hypergraph model to rate data samples' and categorical feature values' relevance to classification labels. The proposed Hypergraph-based Importance ratings are theoretically grounded on the hypergraph cut conductance minimization concept. As a result of using hypergraph representation, which is a lossless representation from the perspective of higher-order relationships in data, our approach allows for more precise exploitation of the information on feature and sample coincidences. The solution was tested using two scenarios: undersampling for imbalanced classification data and feature selection. The experimentation results have proven the good quality of the new approach when compared with other state-of-the-art and baseline methods for both scenarios measured using the average precision evaluation metric.
引用
下载
收藏
页码:1657 / 1683
页数:27
相关论文
共 50 条
  • [1] Hypergraph-based importance assessment for binary classification data
    Pawel Misiorek
    Szymon Janowski
    Knowledge and Information Systems, 2023, 65 : 1657 - 1683
  • [2] Hypergraph-Based Binary Locally Repairable Codes With Availability
    Kim, Jung-Hyun
    Song, Hong-Yeop
    IEEE COMMUNICATIONS LETTERS, 2017, 21 (11) : 2332 - 2335
  • [3] Storing Hypergraph-Based Data Models in Non-hypergraph Data Storage
    Beleczki, Andras
    Molnar, Balint
    Sarkadi-Nagy, Bence
    MODERN APPROACHES FOR INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2018, 769 : 51 - 59
  • [4] Hypergraph-Based Spectral Clustering for Categorical Data
    Li, Yang
    Guo, Chonghui
    2015 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2015, : 396 - 401
  • [5] A TRANSIENT HYPERGRAPH-BASED MODEL FOR DATA ACCESS
    WATTERS, C
    SHEPHERD, MA
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1990, 8 (02) : 77 - 102
  • [6] On Identity Disclosure Control for Hypergraph-Based Data Publishing
    Li, Yidong
    Shen, Hong
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2013, 8 (08) : 1384 - 1396
  • [7] A HYPERGRAPH-BASED HIERARCHICAL DATA STRUCTURE AND ITS APPLICATIONS
    ANCONA, M
    DEFLORIANI, L
    ADVANCES IN ENGINEERING SOFTWARE AND WORKSTATIONS, 1989, 11 (01): : 2 - 11
  • [8] HOT: Hypergraph-based outlier test for categorical data
    Wei, L
    Qian, WN
    Zhou, AY
    Jin, W
    Yu, JX
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2003, 2637 : 399 - 410
  • [9] A Hypergraph-based Model for Cyberincident Related Data Analysis
    Matalobos Veiga, Juan Manuel
    Criado Herrero, Regino
    Romance del Rio, Miguel
    Iglesias Perez, Sergio
    Partida Rodriguez, Alberto
    Hanumanthappa Manjunatha, Karan Kabbur
    PROCEEDINGS OF THE 2024 EUROPEAN INTERDISCIPLINARY CYBERSECURITY CONFERENCE, EICC 2024, 2024, : 161 - 162
  • [10] Hypergraph-based image representation
    Bretto, A
    Gillibert, L
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, PROCEEDINGS, 2005, 3434 : 1 - 11