A user-guided Bayesian framework for ensemble feature selection in life science applications (UBayFS)

被引:5
|
作者
Jenul, Anna [1 ]
Schrunner, Stefan [1 ]
Pilz, Jurgen [2 ]
Tomic, Oliver [1 ]
机构
[1] Norwegian Univ Life Sci, Dept Data Sci, As, Norway
[2] Univ Klagenfurt, Dept Stat, Klagenfurt, Austria
关键词
Ensemble feature selection; Bayesian model; Dirichlet-multinomial; User constraints; CANCER; CLASSIFICATION; DIAGNOSIS;
D O I
10.1007/s10994-022-06221-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection reduces the complexity of high-dimensional datasets and helps to gain insights into systematic variation in the data. These aspects are essential in domains that rely on model interpretability, such as life sciences. We propose a (U)ser-Guided (Bay)esian Framework for (F)eature (S)election, UBayFS, an ensemble feature selection technique embedded in a Bayesian statistical framework. Our generic approach considers two sources of information: data and domain knowledge. From data, we build an ensemble of feature selectors, described by a multinomial likelihood model. Using domain knowledge, the user guides UBayFS by weighting features and penalizing feature blocks or combinations, implemented via a Dirichlet-type prior distribution. Hence, the framework combines three main aspects: ensemble feature selection, expert knowledge, and side constraints. Our experiments demonstrate that UBayFS (a) allows for a balanced trade-off between user knowledge and data observations and (b) achieves accurate and robust results.
引用
收藏
页码:3897 / 3923
页数:27
相关论文
共 50 条
  • [31] A Study of Ensemble Feature Selection and Adversarial Training for Malicious User Detection
    Linjie Zhang
    Xiaoyan Zhu
    Jianfeng Ma
    China Communications, 2023, 20 (10) : 212 - 229
  • [32] Empirical Evaluation of the Ensemble Framework for Feature Selection in DDoS Attack
    Das, Saikat
    Venugopal, Deepak
    Shiva, Sajjan
    Sheldon, Frederick T.
    2020 7TH IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (CSCLOUD 2020)/2020 6TH IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD (EDGECOM 2020), 2020, : 56 - 61
  • [33] Seismic attribute selection for unsupervised seismic facies analysis using user-guided data-adaptive weights
    Zhao, Tao
    Li, Fangyu
    Marfurt, Kurt J.
    GEOPHYSICS, 2018, 83 (02) : O31 - O44
  • [34] A user-guided innovization-based evolutionary algorithm framework for practical multi-objective optimization problems
    Ghosh, Abhiroop
    Deb, Kalyanmoy
    Goodman, Erik
    Averill, Ronald
    ENGINEERING OPTIMIZATION, 2023, 55 (12) : 2084 - 2096
  • [35] A Fuzzy Aggregation based Ensemble Framework for Accurate and Stable Feature Selection
    Shen, Zixiao
    Chen, Xin
    Garibaldi, Jonathan M.
    IEEE CIS INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS 2021 (FUZZ-IEEE), 2021,
  • [36] Majority Based Ensemble Framework for Feature selection using Rough Set
    Ali, Syed Hasnain
    Muzaffar, Abdul Wahab
    Mir, Shumyla Rasheed
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, : 1113 - 1118
  • [37] Incremental Framework for Feature Selection and Bayesian Classification for Multivariate Normal Distribution
    Agrawal, R. K.
    Bala, Manju
    Bala, Rajni
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1469 - 1474
  • [38] An Ensemble Framework of Anomaly Detection using Hybridized Feature Selection Approach (HFSA)
    Haq, Nutan Farah
    Onik, Abdur Rahman
    Shah, Faisal Muhammad
    2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, : 989 - 995
  • [39] Intrusion Detection System with an Ensemble Learning and Feature Selection Framework for IoT Networks
    Rohini, G.
    Gnana Kousalya, C.
    Bino, J.
    IETE JOURNAL OF RESEARCH, 2023, 69 (12) : 8859 - 8875
  • [40] Federated and ensemble learning framework with optimized feature selection for heart disease detection
    Hrizi, Olfa
    Gasmi, Karim
    Alyami, Abdulrahman
    Alkhalil, Adel
    Alrashdi, Ibrahim
    Alqazzaz, Ali
    Ben Ammar, Lassaad
    Mrabet, Manel
    Abdalrahman, Alameen E. M.
    Yahyaoui, Samia
    AIMS MATHEMATICS, 2025, 10 (03): : 7290 - 7318