Integrating data warehouses with web data:: A survey

被引:58
|
作者
Manuel Perez, Juan [1 ]
Berlanga, Rafael [1 ]
Jose Aramburu, Maria [2 ]
Pedersen, Torben Bach [3 ]
机构
[1] Univ Jaume 1, Dept Lenguajes & Sistemas Informat, E-12071 Castellon de La Plana, Spain
[2] Univ Jaume 1, Dept Ingn & Ciencia Computadores, E-12071 Castellon de La Plana, Spain
[3] Aalborg Univ, Dept Comp Sci, DK-9220 Aalborg O, Denmark
关键词
data warehouse repository; XML/XSL/RDF;
D O I
10.1109/TKDE.2007.190746
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper surveys the most relevant research on combining Data Warehouse (DW) and Web data. It studies the XML technologies that are currently being used to integrate, store, query, and retrieve Web data and their application to DWs. The paper reviews different DW distributed architectures and the use of XML languages as an integration tool in these systems. It also introduces the problem of dealing with semistructured data in a DW. It studies Web data repositories, the design of multidimensional databases for XML data sources, and the XML extensions of OnLine Analytical Processing techniques. The paper addresses the application of information retrieval technology in a DW to exploit text-rich document collections. The authors hope that the paper will help to discover the main limitations and opportunities that offer the combination of the DW and the Web fields, as well as to identify open research lines.
引用
收藏
页码:940 / 955
页数:16
相关论文
共 50 条
  • [41] Designing data warehouses
    Theodoratos, D
    Sellis, T
    [J]. DATA & KNOWLEDGE ENGINEERING, 1999, 31 (03) : 279 - 301
  • [42] Integrating brazilian health information systems in order to support the building of data warehouses
    Freire, Sergio Miranda
    Souza, Rômulo Cristovão De
    de Almeida, Rosimary Terezinha
    [J]. Revista Brasileira de Engenharia Biomedica, 2015, 31 (03): : 196 - 207
  • [43] Conceptual design and implementation of spatial data warehouses integrating regular grids of points
    Bimonte, Sandro
    Zaamoune, Mehdi
    Beaune, Philippe
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2017, 10 (09) : 901 - 922
  • [44] Integrating Brazilian health information systems in order to support the building of data warehouses
    Universidade do Estado do Rio de Janeiro – UERJ, Rio de Janeiro
    RJ, Brazil
    不详
    RJ
    CEP 20550-170, Brazil
    不详
    RJ, Brazil
    [J]. Res. Biomed. Eng, 3 (196-207):
  • [45] Data warehouses make data profitable and useful
    [J]. Imaging Mag, 5 (86):
  • [46] The medical data in the knowledge : warehouses and searches of data
    Garcelon, N.
    [J]. ANNALES DE DERMATOLOGIE ET DE VENEREOLOGIE, 2015, 142 (12): : S389 - S390
  • [47] Data Warehouses Federation as a Single Data Warehouse
    Kern, Rafal
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2016, PT I, 2016, 9875 : 356 - 366
  • [48] Survey on mining subjective data on the web
    Mikalai Tsytsarau
    Themis Palpanas
    [J]. Data Mining and Knowledge Discovery, 2012, 24 : 478 - 514
  • [49] Survey on mining subjective data on the web
    Tsytsarau, Mikalai
    Palpanas, Themis
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 24 (03) : 478 - 514
  • [50] A survey of Deep Web data integration
    Liu, Wei
    Meng, Xiao-Feng
    Meng, Wei-Yi
    [J]. Jisuanji Xuebao/Chinese Journal of Computers, 2007, 30 (09): : 1475 - 1489