Mining the SDSS Sky Server SQL Queries Log

被引:0
|
作者
Hirota, Vitor Makiyama [1 ]
Santos, Rafael [1 ]
Raddick, Jordan [2 ]
Thakar, Ani [2 ]
机构
[1] Natl Inst Space Res, Ave Astronautas 1758, Sao Paulo, Brazil
[2] Johns Hopkins Univ, 3400 N Charles St, Baltimore, MD USA
来源
NEXT-GENERATION ANALYST IV | 2016年 / 9851卷
关键词
Text Mining; SQL; Web Logs;
D O I
10.1117/12.2224237
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
SkyServer, the Internet portal for the Sloan Digital Sky Survey (SDSS) astronomic catalog, provides a set of tools that allows data access for astronomers and scientific education. One of SkyServer data access interfaces allows users to enter ad-hoc SQL statements to query the catalog. SkyServer also presents some template queries that can be used as basis for more complex queries. This interface has logged over 330 million queries submitted since 2001. It is expected that analysis of this data can be used to investigate usage patterns, identify potential new classes of queries, find similar queries, etc. and to shed some light on how users interact with the Sloan Digital Sky Survey data and how scientists have adopted the new paradigm of e-Science, which could in turn lead to enhancements on the user interfaces and experience in general. In this paper we review some approaches to SQL query mining, apply the traditional techniques used in the literature and present lessons learned, namely, that the general text mining approach for feature extraction and clustering does not seem to be adequate for this type of data, and, most importantly, we find that this type of analysis can result in very different queries being clustered together.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Expression of SQL statements of local SQL and SQL server 7.0
    He, Pin
    Zhu, Yu
    [J]. Chongqing Jianzhu Daxue Xuebao/Journal of Chongqing Jianzhu University, 2002, 24 (01):
  • [32] 基于SQL Server的SQL优化
    杨亚萍
    [J]. 电脑知识与技术, 2008, 4 (35) : 2536 - 2537
  • [33] XML queries via SQL
    Chen, CX
    Malhotra, A
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 53 - 60
  • [34] VENN DIAGRAMS AND SQL QUERIES
    HALPIN, TA
    [J]. AUSTRALIAN COMPUTER JOURNAL, 1989, 21 (01): : 27 - 32
  • [35] Sensitivity Analysis of SQL Queries
    Laud, Peeter
    Pettai, Martin
    Randmets, Jaak
    [J]. PLAS'18: PROCEEDINGS OF THE 13TH WORKSHOP ON PROGRAMMING LANGUAGES AND ANALYSIS FOR SECURITY, 2018, : 2 - 12
  • [36] MAKE BULLETPROOF SQL QUERIES
    LINTHICUM, DS
    [J]. BYTE, 1995, 20 (02): : 111 - 113
  • [37] Automated Grading of SQL Queries
    Chandra, Bikash
    Banerjee, Ananyo
    Hazra, Udbhas
    Joseph, Mathew
    Sudarshan, S.
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1630 - 1633
  • [38] SQL queries with CASE expressions
    Gryz, Jarek
    Wang, Qiong
    Qian, Xiaoyan
    Zuzarte, Calisto
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2008, 4994 : 351 - +
  • [39] FORMAL SEMANTICS OF SQL QUERIES
    NEGRI, M
    PELAGATTI, G
    SBATTELLA, L
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 1991, 16 (03): : 513 - 534
  • [40] Proving the safety of SQL queries
    Brass, S
    Goldberg, C
    [J]. QSIC 2005: FIFTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, PROCEEDINGS, 2005, : 197 - 204