Scalable classification over SQL databases

被引:8
|
作者
Chaudhuri, S [1 ]
Fayyad, U [1 ]
Bernhardt, J [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
关键词
D O I
10.1109/ICDE.1999.754963
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We identify data-intensive operations that are common to classifiers and develop a middleware that decomposes and schedules these operations efficiently using a backend SQL database. Our approach has the added advantage of not requiring any specialized physical data organization. We demonstrate the scalability characteristics of our enhanced client with experiments on Microsoft Set Sewer 7.0 by varying data size, number of attributes and characteristics of decision trees.
引用
收藏
页码:470 / 479
页数:10
相关论文
共 50 条
  • [41] WMS Performance of Selected SQL and NoSQL Databases
    Schmid, Stephan
    Galicz, Eszter
    Reinhardt, Wolfgang
    [J]. INTERNATIONAL CONFERENCE ON MILITARY TECHNOLOGIES (ICMT 2015), 2015, : 311 - 316
  • [42] Comparative Analysis of performance for SQL and NoSQL Databases
    Diaz Erazo, Amparo Daniela
    Morales Morales, Mario Raul
    Pineda Chavez, Veronica Karina
    Morales Cardoso, Santiago Leonardo
    [J]. 2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,
  • [43] THE IMPACT OF SQL INJECTION ATTACKS ON THE SECURITY OF DATABASES
    Thiyab, Rua Mohamed
    Ali, Musab A. M.
    Basil, Farooq
    Abdulqader
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATICS: EMBRACING ECO-FRIENDLY COMPUTING, 2017, : 323 - 331
  • [44] GQL: A reasonable complex SQL for genomic databases
    Jamil, HM
    [J]. IEEE INTERNATIONAL SYMPOSIUM ON BIO-INFORMATICS AND BIOMEDICAL ENGINEERING, PROCEEDINGS, 2000, : 50 - 59
  • [45] Continuous Deployment and Schema Evolution in SQL Databases
    de Jong, Michael
    van Deursen, Arie
    [J]. 2015 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON RELEASE ENGINEERING, 2015, : 16 - 19
  • [46] Scalable Multi-Query Optimization for Exploratory Queries over Federated Scientific Databases
    Kementsietsidis, Anastasios
    Neven, Frank
    Van de Craen, Dieter
    Vansummeren, Stijn
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 16 - 27
  • [47] A Comparison of NoSQL and SQL Databases over the Hadoop and Spark Cloud Platforms using Machine Learning Algorithms
    Lee, Chao-Hsien
    Shih, Zhe-Wei
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2018,
  • [48] Bringing SQL databases to key-based NoSQL databases: a canonical approach
    Geomar A. Schreiner
    Denio Duarte
    Ronaldo dos Santos Mello
    [J]. Computing, 2020, 102 : 221 - 246
  • [49] Bringing SQL databases to key-based NoSQL databases: a canonical approach
    Schreiner, Geomar A.
    Duarte, Denio
    Mello, Ronaldo dos Santos
    [J]. COMPUTING, 2020, 102 (01) : 221 - 246
  • [50] Yesquel: scalable SQL storage for Web applications
    Aguilera, Marcos K.
    Leners, Joshua B.
    Kotla, Ramakrishna
    Walfish, Michael
    [J]. PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, 2015,