Data streams and data synopses for massive data sets (Invited talk)

被引:0
|
作者
Matias, Y [1 ]
机构
[1] Tel Aviv Univ, IL-69978 Tel Aviv, Israel
[2] Stanford Univ, HyperRoll Inc, Stanford, CA 94305 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the proliferation of data intensive applications, it has become necessary to develop new techniques to handle massive data sets. Traditional algorithmic techniques and data structures are not always suitable to handle the amount of data that is required and the fact that the data often streams by and cannot be accessed again. A field of research established over the past decade is that of handling massive data sets using data synopses, and developing algorithmic techniques for data stream models. We will discuss some of the research work that has been done in the field, and provide a decades' perspective to data synopses and data streams.
引用
收藏
页码:8 / 9
页数:2
相关论文
共 50 条
  • [1] Data streams and data synopses for massive data sets (Invited talk)
    Matias, Y
    [J]. KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 8 - 9
  • [2] Querying Distributed Data Streams (Invited Keynote Talk)
    Garofalakis, Minos
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2014), 2014, 8716 : 1 - 10
  • [3] Scaling clustering algorithms for massive data sets using data streams
    Nittel, S
    Leung, KT
    Braverman, A
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 830 - 830
  • [4] Randomized synopses for query assurance on data streams
    Yi, Ke
    Li, Feifei
    Hadjieleftheriou, Marios
    Kollios, George
    Srivastava, Divesh
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 416 - +
  • [5] Wavelet-based amnesic synopses for data streams
    Chen, Huahui
    Shi, Baile
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (02): : 268 - 279
  • [6] Synopses for Massive Data: Samples, Histograms, Wavelets, Sketches
    Cormode, Graham
    Garofalakis, Minos
    Haas, Peter J.
    Jermaine, Chris
    [J]. FOUNDATIONS AND TRENDS IN DATABASES, 2011, 4 (1-3): : 1 - 294
  • [7] RDF as a data model (Invited talk)
    Gutierrez, Claudio
    [J]. Fourth IFIP International Conference on Theoretical Computer Science - TCS 2006, 2006, 209 : 7 - 7
  • [8] Massive data sets, data mining, and decision support
    Dalal, S
    Dumais, S
    Kettenring, J
    Kurien, V
    McIntosh, A
    Maitra, R
    [J]. MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 329 - 329
  • [9] Parallelizing clustering of geoscientific data sets using data streams
    Nittel, S
    Leung, KT
    [J]. 16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2004, : 73 - 84
  • [10] Scalable Splitting of Massive Data Streams
    Zeitler, Erik
    Risch, Tore
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 184 - 198