Feature discovery in classification problems

被引:0
|
作者
del Valle, M [1 ]
Sánchez, B
Lago-Fernández, LF
Corbacho, FJ
机构
[1] Univ Autonoma Madrid, Escuela Politecn Super, Madrid 28049, Spain
[2] Telefon Invest & Desarrollo, Madrid 28043, Spain
[3] Cognodata Consulting, Madrid 28010, Spain
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In most problems of Knowledge Discovery the human analyst previously constructs a new set of features, derived from the initial problem input attributes, based on a priori knowledge of the problem structure. These different features are constructed from different transformations which must be selected by the analyst. This paper provides a first step towards a methodology that allows the search for near-optimal representations in classification problems by allowing the automatic selection and composition of feature transformations from an initial set of basis functions. In many cases, the original representation for the problem data is not the most appropriate, and the search for a new representation space that is closer to the structure of the problem to be solved is critical for the successful solution of the problem. On the other hand, once this optimal representation is found, most of the problems may be solved by a linear classification method. As a proof of concept we present two classification problems where the class distributions have a very intricate overlap on the space of original attributes. For these problems, the proposed methodology is able to construct representations based on function compositions from the trigonometric and polynomial bases that provide a solution where some of the classical learning methods, e.g. multilayer perceptrons and decision trees, fail. The methodology consists of a discrete search within the space of compositions of the basis functions and a linear mapping performed by a Fisher discriminant. We play special emphasis on the first part. Finding the optimal composition of basis functions is a difficult problem because of its nongradient nature and the large number of possible combinations. We rely on the global search capabilities of a genetic algorithm to scan the space of function compositions.
引用
收藏
页码:486 / 496
页数:11
相关论文
共 50 条
  • [1] Feature extraction for classification in knowledge discovery systems
    Pechenizkiy, M
    Puuronen, S
    Tsymbal, A
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2003, 2773 : 526 - 532
  • [2] DISCOVERY AND CLASSIFICATION AS PROBLEMS IN THE PHILOSOPHY OF ASTRONOMY
    Dick, Steven J.
    JOURNAL OF ASTRONOMICAL HISTORY AND HERITAGE, 2024, 27 (04): : 931 - 950
  • [3] Feature selection for pattern classification problems
    Zhang, L
    Sun, G
    Guo, J
    FOURTH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2004, : 233 - 237
  • [4] Input feature selection for classification problems
    Kwak, N
    Choi, CH
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (01): : 143 - 159
  • [5] Analysis of Feature Selection Techniques for Classification Problems
    Adamov, Abzetdin Z.
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [6] Genetic Programming for Feature Ranking in Classification Problems
    Neshatian, Kourosh
    Zhang, Mengjie
    Andreae, Peter
    SIMULATED EVOLUTION AND LEARNING, PROCEEDINGS, 2008, 5361 : 544 - 554
  • [7] Feature selection for multiple binary classification problems
    Department of Biomedical Engineering, Technion - Israel Inst. T., Haifa, Israel
    Pattern Recogn. Lett., 8 (823-832):
  • [8] Feature selection for multiple binary classification problems
    Shapira, Y
    Gath, I
    PATTERN RECOGNITION LETTERS, 1999, 20 (08) : 823 - 832
  • [9] Evolutionary computation for feature selection in classification problems
    de la Iglesia, Beatriz
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2013, 3 (06) : 381 - 407
  • [10] Feature discovery in NIR spectroscopy based Rocha pear classification
    Daniel, Mariana
    Guerra, Rui
    Brazio, Antonio
    Rodrigues, Daniela
    Cavaco, Ana Margarida
    Antunes, Maria Dulce
    de Oliveira, Jose Valente
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 177