Supervised Taxonomies-Algorithms and Applications

被引:4
|
作者
Amalaman, Paul K. [1 ]
Eick, Christoph F. [1 ]
Wang, Chong [1 ]
机构
[1] Univ Houston, Dept Comp Sci, Houston, TX 77204 USA
关键词
Clustering; classification complexity; supervised clustering; supervised taxonomy; subclass discovery; class modality; hierarchical clustering;
D O I
10.1109/TKDE.2017.2698451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper focuses on a new type of taxonomy called supervised taxonomy (ST). Supervised taxonomies are generated considering background information concerning class labels in addition to distance metrics, and are capable of capturing class-uniform regions in a dataset. A hierarchical, agglomerative clustering algorithm, called STAXAC that generates STs is proposed and its properties are analyzed. Experimental results are presented that show that STAXAC produces purer taxonomies than the neighbor-joining (NJ) algorithm-a very popular taxonomy generation algorithm. We introduced novel measures and algorithms that assess classification complexity, class modality, and show that STs can be used as the main input of an effective data-editing tool to enhance the accuracy of k-nearest neighbor classifiers. We demonstrated in our experimental evaluation that assessing the classification complexity of a ST provides us with a good estimate of the difficulty of the classification problem at hand. Moreover, a class modality discovery tool (CMD) has been provided that-based on a domain expert's notion of what constitutes a "note-worthy" subclass-determines if specific classes in the dataset are zero-modal, unimodal, and multi-modal.
引用
下载
收藏
页码:2040 / 2052
页数:13
相关论文
共 50 条
  • [31] Accuracy estimation for supervised learning algorithms
    Glover, CW
    Oblow, EM
    Rao, NSV
    APPLICATIONS AND SCIENCE OF ARTIFICIAL NEURAL NETWORKS III, 1997, 3077 : 794 - 802
  • [32] Dealing with the evaluation of supervised classification algorithms
    Santafe, Guzman
    Inza, Inaki
    Lozano, Jose A.
    ARTIFICIAL INTELLIGENCE REVIEW, 2015, 44 (04) : 467 - 508
  • [33] SUMATRA: Supervised Modeling of ATR Algorithms
    Narayanaswami, Ranga
    Gandhe, Avinash
    Mehra, Raman K.
    PROCEEDINGS OF THE IEEE 2010 NATIONAL AEROSPACE AND ELECTRONICS CONFERENCE (NAECON), 2010, : 99 - 106
  • [34] SUPERVISED LEARNING ALGORITHMS FOR FAMINE PREDICTION
    Okori, Washington
    Obua, Joseph
    APPLIED ARTIFICIAL INTELLIGENCE, 2011, 25 (09) : 822 - 835
  • [35] A Review of Supervised Machine Learning Algorithms
    Singh, Amanpreet
    Thakur, Narina
    Sharma, Aakanksha
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1310 - 1315
  • [36] A Unified Methodology to Evaluate Supervised and Non-Supervised Classification Algorithms
    Godoy Calderon, Salvador
    Martinez Trinidad, Jose Francisco
    Lazo-Cortes, Manuel S.
    de Leon Santiago, Juan Luis Diaz
    COMPUTACION Y SISTEMAS, 2006, 9 (04): : 370 - 379
  • [37] Theory and applications for the supervised learning method based on gradient algorithms Part I-Fundamental algorithm
    Si, Jie
    Zhou, Guian
    Li, Han
    Han, Yingduo
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 1997, 37 (07): : 71 - 73
  • [38] Update on International Medical Taxonomies of Biomarkers and Their Applications in Management of Thyroid Cancers
    Trovato, Maria
    DIAGNOSTICS, 2022, 12 (03)
  • [39] Eco-Friendly Route Planning Algorithms: Taxonomies, Literature Review and Future Directions
    Fahmin, Ahmed
    Cheema, Muhammad Aamir
    Eunus Ali, Mohammed
    Nadjaran Toosi, Adel
    Lu, Hua
    Li, Huan
    Taniar, David
    Rakha, Hesham A.
    Shen, Bojie
    ACM Computing Surveys, 2024, 57 (01)
  • [40] Proposal for a unified methodology for evaluating supervised and non-supervised classification algorithms
    Godoy-Calderon, Salvador
    Martinez-Trinidad, J. Fco.
    Cortes, Manuel Lazo
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2006, 4225 : 674 - 685