Agora: Bringing Together Datasets, Algorithms, Models and More in a Unified Ecosystem [Vision]

被引:10
|
作者
Traub J. [1 ]
Kaoudi Z. [1 ]
Quiané-Ruiz J.-A. [1 ]
Markl V. [1 ]
机构
[1] Technische Universität Berlin, Berlin
来源
SIGMOD Record | 2020年 / 49卷 / 04期
关键词
26;
D O I
10.1145/3456859.3456861
中图分类号
学科分类号
摘要
Data science and artificial intelligence are driven by a plethora of diverse data-related assets, including datasets, data streams, algorithms, processing software, compute resources, and domain knowledge. As providing all these assets requires a huge investment, data science and artificial intelligence technologies are currently dominated by a small number of providers who can afford these investments. This leads to lock-in effects and hinders features that require a flexible exchange of assets among users. In this paper, we introduce Agora, our vision towards a unified ecosystem that brings together data, algorithms, models, and computational resources and provides them to a broad audience. Agora (i) treats assets as first-class citizens and leverages a fine-grained exchange of assets, (ii) allows for combining assets to novel applications, and (iii) flexibly executes such applications on available resources. As a result, it enables easy creation and composition of data science pipelines as well as their scalable execution. In contrast to existing data management systems, Agora operates in a heavily decentralized and dynamic environment: Data, algorithms, and even compute resources are dynamically created, modified, and removed by different stakeholders. Agora presents novel research directions for the data management community as a whole: It requires to combine our traditional expertise in scalable data processing and management with infrastructure provisioning as well as economic and application aspects of data, algorithms, and infrastructure. © 2021 Copyright is held by the owner/author(s).
引用
收藏
页码:6 / 11
页数:5
相关论文
共 2 条