Pruning relations for substructure discovery of multi-relational databases

被引:0
|
作者
Guo, Hongyu [1 ]
Viktor, Herna L. [1 ]
Paquet, Eric [2 ]
机构
[1] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
[2] Natl Res Council Canada, Ottawa, ON, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multirelational data mining methods discover patterns across multiple interlinked tables (relations) in a relational database. In many large organizations, such a multi-relational database spans numerous departments and/or subdivisions, which are involved in different aspects of the enterprise such as customer profiling, fraud detection, inventory management, financial management, and so on. When considering multirelational classification, it follows that these subdivisions will express different interests in the data, leading to the need to explore various subsets of relevant relations with high utility with respect to the target class. The paper presents a novel approach for pruning the uninteresting relations of a relational database where relations come from such different parties and spans many classification tasks. We aim to create a pruned structure and thus minimize predictive performance loss on the final classification model. Our method identifies a set of strongly uncorrelated subgraphs to use for training and discards all others. The experiments performed demonstrate that our strategy is able to significantly reduce the size of the relational schema without sacrificing predictive accuracy.
引用
收藏
页码:462 / +
页数:3
相关论文
共 50 条
  • [1] Classification of Multi-relational Databases
    Wang, Xinchun
    Zhang, Sujuan
    [J]. APPLIED INFORMATICS AND COMMUNICATION, PT 2, 2011, 225 : 390 - +
  • [2] Multi-relational data mining in medical databases
    Habrard, A
    Bernard, M
    Jacquenet, F
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2003, 2780 : 365 - 374
  • [3] A multi-relational rule discovery system
    Uludag, M
    Tolun, MR
    Etzold, T
    [J]. COMPUTER AND INFORMATION SCIENCES - ISCIS 2003, 2003, 2869 : 252 - 259
  • [4] Multi-Relational Concept Discovery with Aggregation
    Kavurucu, Yusuf
    Senkul, Pinar
    Toroslu, I. Hakki
    [J]. 2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 247 - 252
  • [5] An algorithm for multi-relational discovery of subgroups
    Wrobel, S
    [J]. PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1263 : 78 - 87
  • [6] Interesting instance discovery in multi-relational data
    Lin, SD
    [J]. PROCEEDING OF THE NINETEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE SIXTEENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2004, : 991 - 992
  • [7] Learning from Skewed Class Multi-relational Databases
    Guo, Hongyu
    Viktor, Herna L.
    [J]. FUNDAMENTA INFORMATICAE, 2008, 89 (01) : 69 - 94
  • [8] Multi-Relational Script Learning for Discourse Relations
    Lee, I-Ta
    Goldwasser, Dan
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4214 - 4226
  • [9] Influencer discovery algorithm in a multi-relational network
    Ma, Ning
    Liu, Yijun
    Chi, Yuxue
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 510 : 415 - 425
  • [10] Exploring multi-relational temporal databases with a propositional sequence miner
    Ferreira, Carlos Abreu
    Gama, Joao
    Costa, Vitor Santos
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE, 2015, 4 (1-2) : 11 - 20