Visualization and Integration of Databases using Self-Organizing Map

被引:2
|
作者
Bourennani, Farid [1 ]
Pu, Ken Q. [1 ]
Zhu, Ying [1 ]
机构
[1] Univ Ontario, Inst Technol, Toronto, ON, Canada
关键词
SOM; Common Item Based Classifier (CIBC); Data Integration; Information Retrieval (IR);
D O I
10.1109/DBKDA.2009.30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the growing computer networks, accessible data is becoming increasingly distributed. Understanding and integrating remote and unfamiliar data sources are important data management issues. In this paper, we propose to utilize self-organizing maps (SOM) clustering to aid with the visualization of similar columns, and integration of relational database tables and attributes based on the content. In order to accommodate heterogeneous data types found in relational databases, we extended the TFIDF measure to handle, in addition to text, numerical attribute types for coincident meaning extraction. We present a SOM clustering based visualization algorithm allowing the user to browse the heterogeneously typed database attributes and discover semantically similar clusters. Additionally, we propose a new algorithm Common Item Based Classifier (CIBC) to smoothen the homogeneity of the clusters obtained by SOM. The discovered semantic clusters can significantly aid in manual or automated constructions of data integrity constraints in data cleaning or schema mappings in data integration.
引用
收藏
页码:155 / 160
页数:6
相关论文
共 50 条
  • [1] Self-Organizing Map in Process Visualization
    Sirola, Miki
    Talonen, Jaakko
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT II: 15TH INTERNATIONAL CONFERENCE, KES 2011, 2011, 6882 : 196 - 202
  • [2] Representing structural databases in a self-organizing map
    Wehrens, R
    Melssen, W
    Buydens, L
    de Gelder, R
    [J]. ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE CRYSTAL ENGINEERING AND MATERIALS, 2005, 61 (05) : 548 - 557
  • [3] Process state and progress visualization using self-organizing map
    Hakala, Risto
    Simila, Timo
    Sirola, Miki
    Parviainen, Jukka
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2006, PROCEEDINGS, 2006, 4224 : 73 - 80
  • [4] Clustering and visualization of bankruptcy trajectory using self-organizing map
    Chen, Ning
    Ribeiro, Bernardete
    Vieira, Armando
    Chen, An
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (01) : 385 - 393
  • [5] Time Series Visualization Using Asymmetric Self-Organizing Map
    Olszewski, Dominik
    Kacprzyk, Janusz
    Zadrozny, Slawomir
    [J]. ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, ICANNGA 2013, 2013, 7824 : 40 - 49
  • [6] Comparison of visualization of optimal clustering using self-organizing map and growing hierarchical self-organizing map in cellular manufacturing system
    Chattopadhyay, Manojit
    Dan, Pranab K.
    Mazumdar, Sitanath
    [J]. APPLIED SOFT COMPUTING, 2014, 22 : 528 - 543
  • [7] Exploring soil databases: a self-organizing map approach
    Rivera, D.
    Sandoval, M.
    Godoy, A.
    [J]. SOIL USE AND MANAGEMENT, 2015, 31 (01) : 121 - 131
  • [8] Comparative Study of Self-Organizing Map and Deep Self-Organizing Map using MATLAB
    Kumar, Indra D.
    Kounte, Manjunath R.
    [J]. 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND SIGNAL PROCESSING (ICCSP), VOL. 1, 2016, : 1020 - 1023
  • [9] Visualization and data mining of Pareto solutions using self-organizing map
    Obayashi, S
    Sasaki, D
    [J]. EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, PROCEEDINGS, 2003, 2632 : 796 - 809
  • [10] A visualization method for web customer reviews and evaluations using a self-organizing map
    Saitoh, Fumiaki
    [J]. Journal of Japan Industrial Management Association, 2014, 65 (03) : 180 - 190