A user-guided Bayesian framework for ensemble feature selection in life science applications (UBayFS)

被引:5
|
作者
Jenul, Anna [1 ]
Schrunner, Stefan [1 ]
Pilz, Jurgen [2 ]
Tomic, Oliver [1 ]
机构
[1] Norwegian Univ Life Sci, Dept Data Sci, As, Norway
[2] Univ Klagenfurt, Dept Stat, Klagenfurt, Austria
关键词
Ensemble feature selection; Bayesian model; Dirichlet-multinomial; User constraints; CANCER; CLASSIFICATION; DIAGNOSIS;
D O I
10.1007/s10994-022-06221-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection reduces the complexity of high-dimensional datasets and helps to gain insights into systematic variation in the data. These aspects are essential in domains that rely on model interpretability, such as life sciences. We propose a (U)ser-Guided (Bay)esian Framework for (F)eature (S)election, UBayFS, an ensemble feature selection technique embedded in a Bayesian statistical framework. Our generic approach considers two sources of information: data and domain knowledge. From data, we build an ensemble of feature selectors, described by a multinomial likelihood model. Using domain knowledge, the user guides UBayFS by weighting features and penalizing feature blocks or combinations, implemented via a Dirichlet-type prior distribution. Hence, the framework combines three main aspects: ensemble feature selection, expert knowledge, and side constraints. Our experiments demonstrate that UBayFS (a) allows for a balanced trade-off between user knowledge and data observations and (b) achieves accurate and robust results.
引用
收藏
页码:3897 / 3923
页数:27
相关论文
共 50 条
  • [41] Bayesian tracking fusion framework with online classifier ensemble for immersive visual applications
    Zhang, Peng
    Zhuo, Tao
    Zhang, Yanning
    Huang, Hanqiao
    Chen, Kangli
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (09) : 5075 - 5092
  • [42] Bayesian tracking fusion framework with online classifier ensemble for immersive visual applications
    Peng Zhang
    Tao Zhuo
    Yanning Zhang
    Hanqiao Huang
    Kangli Chen
    Multimedia Tools and Applications, 2016, 75 : 5075 - 5092
  • [43] Feature Selection in Life Science Classification: Metaheuristic Swarm Search
    Fong, Simon
    Deb, Suash
    Yang, Xin-She
    Li, Jinyan
    IT PROFESSIONAL, 2014, 16 (04) : 24 - 29
  • [44] The Detection of breast cancer based on Dynamic Feature Selection with EM-Bayesian Ensemble Classifier
    Fu, Qiang
    Feng, Jun
    Wang, Huiya
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 88 - +
  • [45] Enhancing software defect prediction: a framework with improved feature selection and ensemble machine learning
    Ali, Misbah
    Mazhar, Tehseen
    Al-Rasheed, Amal
    Shahzad, Tariq
    Ghadi, Yazeed Yasin
    Khan, Muhammad Amir
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [46] Hybrid Feature Selection and Heterogeneous Clustering Ensemble Framework for Detection of Circulating Tumor Cells
    Mythili, S.
    Kumar, A. V. Senthil
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2016, 6 (05) : 1160 - 1166
  • [47] An Efficient Intrusion Detection Framework Based on Embedding Feature Selection and Ensemble Learning Technique
    Mokbal, Fawaz
    Dan, Wang
    Osman, Musa
    Ping, Yang
    Alsamhi, Saeed
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (02) : 237 - 248
  • [48] Stacking-Based Ensemble Framework and Feature Selection Technique for the Detection of Breast Cancer
    Chaurasia V.
    Pal S.
    SN Computer Science, 2021, 2 (2)
  • [49] An ensemble-based feature selection framework for early detection of Parkinson's disease based on feature correlation analysis
    Masood, Sarfaraz
    Maqsood, Khwaja Wisal
    Pal, Om
    Kumar, Chanchal
    MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2021,
  • [50] An effective ensemble classification framework using random forests and a correlation based feature selection technique
    Chutia, Dibyajyoti
    Bhattacharyya, Dhruba Kumar
    Sarma, Jaganath
    Raju, Penumetcha Narasa Lakshmi
    TRANSACTIONS IN GIS, 2017, 21 (06) : 1165 - 1178