Semi-automatic mining of correlated data from a complex database: correlation network visualization

被引:0
|
作者
Lexa, Matej [1 ]
Lapar, Radovan [1 ]
机构
[1] Masaryk Univ, Fac Informat, Bot 68a, Brno 60200, Czech Republic
关键词
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In previous work we have addressed the issue of frequent ad-hoc queries in deeply-structured databases. We wrote a library of functions AutodenormLib.py for issuing proper JOIN commands to denormalize an arbitrary subset of stored data for downstream processing. This may include statistical analysis, visualization or machine learning. Here, we visualize the content of the Thalamoss biomedical database as a correlation network. The network is created by calculating pairwise correlations through all pairs of variables, whether they be numerical, ordinal or nominal. We subsequently construct the network over the entire set of variables, clustering variables with similar effects to discover group relationships between the various biomedical characteristics. We use a semi-automatic procedure that makes the selection of all pairs possible and discuss issues of dealing with different types of variables. This is done either by limiting the analysis to numerical and ordinal ones, or by binning their values into intervals of values. Knowledge extracted from the data in this mode can be used to select variables for statistical models, or as markers of medically interesting conditions.
引用
收藏
页数:2
相关论文
共 50 条
  • [1] A Semi-automatic Approach for Tech Mining and Interactive Taxonomy Visualization
    Karakatsanis, Ioannis
    Tsoupos, Alexandros
    Woon, Wei Lee
    [J]. DATA ANALYTICS FOR RENEWABLE ENERGY INTEGRATION (DARE 2016), 2017, 10097 : 102 - 115
  • [2] Data Mining Techniques for Semi-Automatic Signature Generation
    Tylman, Wojciech
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DEPENDABILITY OF COMPUTER SYSTEMS, 2009, : 210 - 217
  • [3] Semi-automatic construction of ontology based on data mining technique
    Wang, Jingyun
    Flanagan, Brendan
    Ogata, Hiroaki
    [J]. 2017 6TH IIAI INTERNATIONAL CONGRESS ON ADVANCED APPLIED INFORMATICS (IIAI-AAI), 2017, : 511 - 515
  • [4] Semi-automatic, semantic discovery of properties from database schemes
    Palopoli, L
    Sacca, D
    Ursino, D
    [J]. IDEAS 98 - INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 1998, : 244 - 253
  • [5] Semi-Automatic Generation of Competency Maps Based on Educational Data Mining
    Alfonso, David
    Manjarres, Angeles
    Pickin, Simon
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 744 - 760
  • [6] A data mining methodology and its application to semi-automatic knowledge acquisition
    Klemettinen, M
    Mannila, H
    Toivonen, H
    [J]. EIGHTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 1997, : 670 - 677
  • [7] Semi-Automatic Generation of Competency Maps Based on Educational Data Mining
    David Alfonso
    Angeles Manjarrés
    Simon Pickin
    [J]. International Journal of Computational Intelligence Systems, 2019, 12 : 744 - 760
  • [8] Semi-automatic techniques for deriving interscheme properties from database schemes
    Palopoli, L
    Saccà, D
    Ursino, D
    [J]. DATA & KNOWLEDGE ENGINEERING, 1999, 30 (03) : 239 - 273
  • [9] Semi-Automatic Pipe Network Reconstruction Using Point Cloud Data
    Rodrigues, Patrick B.
    Crovella, Paul L.
    [J]. CONSTRUCTION RESEARCH CONGRESS 2022: COMPUTER APPLICATIONS, AUTOMATION, AND DATA ANALYTICS, 2022, : 1086 - 1095
  • [10] AlignVis: Semi-automatic Alignment and Visualization of Parallel Translations
    Alharbi, Mohammad
    Cheesman, Tom
    Laramee, Robert S.
    [J]. 2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 98 - 108