Visualization and Integration of Databases using Self-Organizing Map

被引:2
|
作者
Bourennani, Farid [1 ]
Pu, Ken Q. [1 ]
Zhu, Ying [1 ]
机构
[1] Univ Ontario, Inst Technol, Toronto, ON, Canada
关键词
SOM; Common Item Based Classifier (CIBC); Data Integration; Information Retrieval (IR);
D O I
10.1109/DBKDA.2009.30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the growing computer networks, accessible data is becoming increasingly distributed. Understanding and integrating remote and unfamiliar data sources are important data management issues. In this paper, we propose to utilize self-organizing maps (SOM) clustering to aid with the visualization of similar columns, and integration of relational database tables and attributes based on the content. In order to accommodate heterogeneous data types found in relational databases, we extended the TFIDF measure to handle, in addition to text, numerical attribute types for coincident meaning extraction. We present a SOM clustering based visualization algorithm allowing the user to browse the heterogeneously typed database attributes and discover semantically similar clusters. Additionally, we propose a new algorithm Common Item Based Classifier (CIBC) to smoothen the homogeneity of the clusters obtained by SOM. The discovered semantic clusters can significantly aid in manual or automated constructions of data integrity constraints in data cleaning or schema mappings in data integration.
引用
收藏
页码:155 / 160
页数:6
相关论文
共 50 条
  • [21] Expanding Self-Organizing Map for data visualization and cluster analysis
    Jin, HD
    Shum, WH
    Leung, KS
    Wong, ML
    INFORMATION SCIENCES, 2004, 163 (1-3) : 157 - 173
  • [22] Fusion of self-organizing map and granular self-organizing map for microblog summarization
    Naveen Saini
    Sriparna Saha
    Sahil Mansoori
    Pushpak Bhattacharyya
    Soft Computing, 2020, 24 : 18699 - 18711
  • [23] Visualization and clustering of categorical data with probabilistic self-organizing map
    Lebbah, Mustapha
    Benabdeslem, Khalid
    NEURAL COMPUTING & APPLICATIONS, 2010, 19 (03): : 393 - 404
  • [24] A self-organizing map with expanding force for data clustering and visualization
    Shum, WH
    Jin, HD
    Leung, KS
    Wong, ML
    2002 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2002, : 434 - 441
  • [25] Fusion of self-organizing map and granular self-organizing map for microblog summarization
    Saini, Naveen
    Saha, Sriparna
    Mansoori, Sahil
    Bhattacharyya, Pushpak
    SOFT COMPUTING, 2020, 24 (24) : 18699 - 18711
  • [26] Self-Organizing Map and clustering algorithms for the analysis of occupational accident databases
    Palamara, Federica
    Piglione, Federico
    Piccinini, Norberto
    SAFETY SCIENCE, 2011, 49 (8-9) : 1215 - 1230
  • [27] Clustering method using self-organizing map
    Endo, M
    Ueno, M
    Tanabe, T
    Yamamoto, M
    NEURAL NETWORKS FOR SIGNAL PROCESSING X, VOLS 1 AND 2, PROCEEDINGS, 2000, : 261 - 270
  • [28] Word Learning Using a Self-Organizing Map
    Li, Lishu
    Chen, Qinghua
    Cui, Jiaxin
    Fang, Fukang
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 336 - +
  • [29] Clustering method using self-organizing map
    Endo, Masahiro
    Ueno, Masahiro
    Tanabe, Takaya
    Yamamoto, Manabu
    Neural Networks for Signal Processing - Proceedings of the IEEE Workshop, 2000, 1 : 261 - 270
  • [30] Clustering, Noise Reduction and Visualization Using Features Extracted from the Self-Organizing Map
    Brito da Silva, Leonardo Enzo
    Ferreira Costa, Jose Alfredo
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2013, 2013, 8206 : 242 - 251