Big Metadata: When Metadata is Big Data

被引:5
|
作者
Edara, Pavan [1 ]
Pasumansky, Mosha [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2021年 / 14卷 / 12期
关键词
DREMEL;
D O I
10.14778/3476311.3476385
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid emergence of cloud data warehouses like Google Big-Query has redefined the landscape of data analytics. With the growth of data volumes, such systems need to scale to hundreds of EiB of data in the near future. This growth is accompanied by an increase in the number of objects stored and the amount of metadata such systems must manage. Traditionally, Big Data systems have tried to reduce the amount of metadata in order to scale the system, often compromising query performance. In Google BigQuery, we built a metadata management system that demonstrates that massive scale can be achieved without such tradeoffs. We recognized the benefits that fine grained metadata provides for query processing and we built a metadata system to manage it effectively. We use the same distributed query processing and data management techniques that we use for managing data to handle Big metadata. Today, BigQuery uses these techniques to support queries over billions of objects and their metadata.
引用
收藏
页码:3083 / 3095
页数:13
相关论文
共 50 条
  • [1] Data, Big Data, and Metadata in Anesthesiology
    Levin, Matthew A.
    Wanderer, Jonathan P.
    Ehrenfeld, Jesse M.
    [J]. ANESTHESIA AND ANALGESIA, 2015, 121 (06): : 1661 - 1667
  • [3] Metadata management in a big data infrastructure
    Holom, Roxana-Maria
    Rafetseder, Katharina
    Kritzinger, Stefanie
    Sehrschoen, Herald
    [J]. INTERNATIONAL CONFERENCE ON INDUSTRY 4.0 AND SMART MANUFACTURING (ISM 2019), 2020, 42 : 375 - 382
  • [4] Metadata handling for Big Data projects
    Golosova, M.
    Aulov, V
    Kaida, A.
    [J]. BIGDATA CONFERENCE (FORMERLY INTERNATIONAL CONFERENCE ON BIG DATA AND ITS APPLICATIONS), 2018, 1117
  • [5] Big Metadata,Smart Metadata,and Metadata Capital:Toward Greater Synergy Between Data Science and Metadata
    Jane Greenberg
    [J]. JournalofDataandInformationScience., 2017, 2 (03) - 36
  • [6] An Improved Metadata Model for Big Data Processing in Cloud Data Centers
    Mir, Nader F.
    Marreddy, Navyatha
    Nigam, Prita
    [J]. PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 1417 - 1420
  • [7] OAMS: A Highly Reliable Metadata Service for Big Data Storage
    Zhou, Jiang
    Guo, Jing
    Wang, Weiping
    Du, Cuilan
    Gu, Xiaoyan
    Meng, Dan
    [J]. 2013 IEEE 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE 2013), 2013, : 1287 - 1294
  • [8] Small values in big data: The continuing need for appropriate metadata
    Stow, Craig A.
    Webster, Katherine E.
    Wagner, Tyler
    Lottig, Noah
    Soranno, Patricia A.
    Cha, YoonKyung
    [J]. ECOLOGICAL INFORMATICS, 2018, 45 : 26 - 30
  • [9] An Efficient and Metadata-Aware Big Data Storage Architecture
    Jin, Rize
    Paik, Joon-Young
    Biadgie, Yenewondim
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2020, 2020, 12115 : 146 - 152
  • [10] ELECTRONIC HEALTH RECORDS DATA AND METADATA: Challenges for Big Data in the United States
    Sweet, Lauren E.
    Moulaison, Heather Lea
    [J]. BIG DATA, 2013, 1 (04) : BD245 - BD251