Scaling access to heterogeneous data sources with disco

被引:79
|
作者
Tomasic, A [1 ]
Raschid, L
Valduriez, P
机构
[1] Inst Natl Rech Informat & Automat, F-78153 Le Chesnay, France
[2] Univ Maryland, Maryland Business Sch, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
heterogeneous database; query reformulation; source capability; heterogeneous cost model; partial answer; partial evaluation;
D O I
10.1109/69.729736
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accessing many data sources aggravates problems for users of heterogeneous distributed databases. Database administrators must deal with fragile mediators, that is, mediators with schemas and views that must be significantly changed to incorporate a new data source. When implementing translators of queries from mediators to data sources, database implementers must deal with data sources that do not support all the functionality required by mediators. Application programmers must deal with graceless failures for unavailable data sources. Queries simply return failure and no further information when data sources are unavailable for query processing. The Distributed information Search COmponent (Disco) addresses these problems. Data modeling techniques manage the connections to data sources, and sources can be added transparently to the users and applications. The interface between mediators and data sources flexibly handles different query languages and different data source functionality. Query rewriting and optimization techniques rewrite queries so they are efficiently evaluated by sources. Query processing and evaluation semantics are developed to process queries over unavailable data sources. In this article. we describe 1) the distributed mediator architecture of Disco; 2) the data model and its modeling of data source connections; 3) the interface to underlying data sources and the query rewriting process; and 4) query processing semantics. We describe several advantages of our system.
引用
收藏
页码:808 / 823
页数:16
相关论文
共 50 条
  • [21] Model Performance Scaling with Multiple Data Sources
    Hashimoto, Tatsunori
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [22] SOURCES OF UNCERTAINTY IN THE RELATIVE SCALING OF SPECTROSCOPIC DATA
    BARIBAUD, T
    SALAMANCA, I
    ALLOIN, D
    WAGNER, S
    ASTRONOMY & ASTROPHYSICS SUPPLEMENT SERIES, 1994, 103 (01): : 121 - 128
  • [23] HDSAnalytics: A Data Analytics Framework for Heterogeneous Data Sources
    Jaybal, Yogalakshmi
    Ramanathan, Chandrashekar
    Rajagopalan, S.
    PROCEEDINGS OF THE ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA (CODS-COMAD'18), 2018, : 11 - 19
  • [24] An Intelligent Data Service Framework for Heterogeneous Data Sources
    Khan, Fakhri Alam
    Rehman, Mujeeb Ur
    Khalid, Afsheen
    Ali, Muhammad
    Imran, Muhammad
    Nawaz, Muhammad
    Rahman, Attaur
    JOURNAL OF GRID COMPUTING, 2019, 17 (03) : 577 - 589
  • [25] Implementing big data lake for heterogeneous data sources
    Mehmood, Hassan
    Gilman, Ekaterina
    Cortes, Marta
    Kostakos, Panos
    Byrne, Andrew
    Valta, Katerina
    Tekes, Stavros
    Riekki, Jukka
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 37 - 44
  • [26] Managing Evolution of Heterogeneous Data Sources of a Data Warehouse
    Solodovnikova, Darja
    Niedrite, Laila
    Svilpe, Lauma
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS (ICEIS 2021), VOL 1, 2021, : 105 - 117
  • [27] An Intelligent Data Service Framework for Heterogeneous Data Sources
    Fakhri Alam Khan
    Mujeeb ur Rehman
    Afsheen Khalid
    Muhammad Ali
    Muhammad Imran
    Muhammad Nawaz
    Attaur Rahman
    Journal of Grid Computing, 2019, 17 : 577 - 589
  • [28] A data access scheme of heterogeneous data resource in grid
    Liu, Gui
    Zhu, Hongyu
    Xie, Xianghui
    Lu, Xiaoliang
    Lu, Linsheng
    SIXTH INTERNATIONAL CONFERENCE ON GRID AND COOPERATIVE COMPUTING, PROCEEDINGS, 2007, : 160 - +
  • [29] Data sources and access for comparative analyses
    Verma, V
    INFORMATION DISSEMINATION AND ACCESS IN RUSSIA AND EASTERN EUROPE: PROBLEMS AND SOLUTIONS IN EAST AND WEST, 1998, 26 : 44 - 54
  • [30] The Sources of Public Access to Personal Data
    Alvarado Avalos, Francisco
    REVISTA CHILENA DE DERECHO Y TECNOLOGIA, 2014, 3 (02): : 205 - 226