Implementation of Projected Clustering based on SQL queries and UDFs in Relational Databases

被引:0
|
作者
Harikumar, Sandhya [1 ]
Haripriya, H. [1 ]
Kaimal, M. R. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Kollam, Kerala, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Projected clustering is one of the clustering approaches that determine the clusters in the subspaces of high dimensional data. Although it is possible to efficiently cluster a very large data set outside a relational database, the time and effort to export and import it can be significant. In commercial RDBMSs, there is no SQL query available for any type of subspace clustering, which is more suitable for large databases with high dimensions and large number of records. Integrating clustering with a relational DBMS using SQL is an important and challenging problem in todays world of Big Data. Projected clustering has the ability to find the closely correlated dimensions and find clusters in the corresponding subspaces. We have designed an SQL version of projected clustering which helps to get the clusters of the records in the database using a single SQL statement which in itself calls other SQL functions defined by us. We have used PostgreSQL DBMS to validate our implementation and have done experimentation with synthetic as well as real data.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [41] Semantic-distance based evaluation of ranking queries over relational databases
    Liang Zhu
    Qin Ma
    Chunnian Liu
    Guojun Mao
    Wenzhu Yang
    [J]. Journal of Intelligent Information Systems, 2010, 35 : 415 - 445
  • [42] Making SQL Queries Correct on Incomplete Databases: A Feasibility Study
    Guagliardo, Paolo
    Libkin, Leonid
    [J]. PODS'16: PROCEEDINGS OF THE 35TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2016, : 211 - 223
  • [43] Ontop: Answering SPARQL Queries over Relational Databases
    Calvanese, Diego
    Cogrel, Benjamin
    Komla-Ebri, Sarah
    Kontchakov, Roman
    Lanti, Davide
    Rezk, Martin
    Rodriguez-Muro, Mariano
    Xiao, Guohui
    [J]. SEMANTIC WEB, 2017, 8 (03) : 471 - 487
  • [44] A method of intelligent search to answer queries in relational databases
    Dutta, Ashit Kumar
    Biswas, Ranjit
    Ansari, Abdul Quaiyum
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2007, 10 (06): : 781 - 798
  • [45] Empirical analysis of the impact of queries on watermarked relational databases
    Olliaro, Martina
    Gort, Maikel Lazaro Perez
    Cortesi, Agostino
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 204
  • [46] Supporting top-kjoin queries in relational databases
    Ihab F. Ilyas
    Walid G. Aref
    Ahmed K. Elmagarmid
    [J]. The VLDB Journal, 2004, 13 : 207 - 221
  • [47] A PRACTICAL METHOD FOR IMPLEMENTING FUZZY QUERIES FOR RELATIONAL DATABASES
    Rybanov, Alexander Aleksandrovich
    [J]. MATHEMATICS AND INFORMATICS, 2022, 65 (04): : 379 - 392
  • [48] Trunk processing queries to deductive databases in their relational representation
    Natl Acad of Sciences of Ukraine, Kiev, Ukraine
    [J]. Eng Simul, 4 (479-489):
  • [49] Energy Modeling and Plan Evaluation for Queries in Relational Databases
    Guo B.
    Yu J.
    Yang D.
    Liao B.
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (04): : 810 - 824
  • [50] A method of vague search to answer queries in relational databases
    Rajpal, Smita
    Doja, M. N.
    Biswas, Ranjit
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2007, 10 (06): : 865 - 880