Hypergraph-based importance assessment for binary classification data

被引:1
|
作者
Misiorek, Pawel [1 ]
Janowski, Szymon [1 ]
机构
[1] Poznan Univ Tech, Inst Comp Sci, Piotrowo 3, PL-60965 Poznan, Poland
关键词
Hypergraphs; Machine learning; Imbalanced data; Random undersampling; Feature selection; GRAPH EDIT DISTANCE; COMPUTATION; ALGORITHM; NETWORK;
D O I
10.1007/s10115-022-01786-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel hypergraph-based framework enabling an assessment of the importance of binary classification data elements. Specifically, we apply the hypergraph model to rate data samples' and categorical feature values' relevance to classification labels. The proposed Hypergraph-based Importance ratings are theoretically grounded on the hypergraph cut conductance minimization concept. As a result of using hypergraph representation, which is a lossless representation from the perspective of higher-order relationships in data, our approach allows for more precise exploitation of the information on feature and sample coincidences. The solution was tested using two scenarios: undersampling for imbalanced classification data and feature selection. The experimentation results have proven the good quality of the new approach when compared with other state-of-the-art and baseline methods for both scenarios measured using the average precision evaluation metric.
引用
下载
收藏
页码:1657 / 1683
页数:27
相关论文
共 50 条
  • [31] Knowledge hypergraph-based approach for data integration and querying: Application to Earth Observation
    Masmoudi, Maroua
    Ben Lamine, Sana Ben Abdallah
    Zghal, Hajer Baazaoui
    Archimede, Bernard
    Karray, Mohamed Hedi
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 : 720 - 740
  • [32] A HYPERGRAPH-BASED INTERCONNECTION NETWORK FOR LARGE MULTICOMPUTERS
    MACKENZIE, LM
    OULDKHAOUA, M
    SUTHERLAND, RJ
    KELLY, T
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 634 : 837 - 838
  • [33] A Framework of Hypergraph-Based Data Placement Among Geo-Distributed Datacenters
    Yu, Boyang
    Pan, Jianping
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2020, 13 (03) : 395 - 409
  • [34] Hypergraph-based netlist hierarchical clustering algorithm
    National ASIC Design Engineering Center, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao, 2009, 1 (44-52):
  • [35] A hypergraph-based learning algorithm for classifying gene expression and arrayCGH data with prior knowledge
    Tian, Ze
    Hwang, TaeHyun
    Kuang, Rui
    BIOINFORMATICS, 2009, 25 (21) : 2831 - 2838
  • [36] HYPERGRAPH-BASED MODELING OF MANUFACTURING SERVICES IN CLOUD MANUFACTURING
    Yu, Meng
    Xu, Wenjun
    Hu, Jiwei
    Zhou, Zude
    Duc Truong Pham
    PROCEEDINGS OF THE ASME 12TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE - 2017, VOL 3, 2017,
  • [37] Hypergraph-based marine 3D data model's design and application
    Department of Surveying and Geo-Informatics, Tongji University, Shanghai 200092, China
    Tongji Daxue Xuebao, 2008, 6 (832-836):
  • [38] Dashboard by-Example: A Hypergraph-based approach to On-demand Data warehousing systems
    Duong Thi Anh Hoang
    Thanh Binh Nguyen
    Tjoa, A. Min
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1853 - 1858
  • [39] Support vector machine with hypergraph-based pairwise constraints
    Hou, Qiuling
    Lv, Meng
    Zhen, Ling
    Jing, Ling
    SPRINGERPLUS, 2016, 5
  • [40] Scalable Hypergraph-based Image Retrieval and Tagging System
    Chen, Lu
    Gao, Yunjun
    Zhang, Yuanliang
    Wang, Sibo
    Zheng, Baihua
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 257 - 268