Towards a Semantic Extract-Transform-Load (ETL) framework for Big Data Integration

被引:53
|
作者
Bansal, Srividya K. [1 ]
机构
[1] Arizona State Univ, Dept Engn & Comp Syst, Mesa, AZ 85212 USA
关键词
Big data; Data integration; Ontology; Semantic technolgies; DESIGN;
D O I
10.1109/BigData.Congress.2014.82
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big Data has become the new ubiquitous term used to describe massive collection of datasets that are difficult to process using traditional database and software techniques. Most of this data is inaccessible to users, as we need technology and tools to find, transform, analyze, and visualize data in order to make it consumable for decision-making. One aspect of Big Data research is dealing with the Variety of data that includes various formats such as structured, numeric, unstructured text data, email, video, audio, stock ticker, etc. Managing, merging, and governing a variety of data is the focus of this paper. This paper proposes a semantic Extract-Transform-Load (ETL) framework that uses semantic technologies to integrate and publish data from multiple sources as open linked data. This includes - creation of a semantic data model to provide a basis for integration and understanding of knowledge from multiple sources; creation of a distributed Web of data using Resource Description Framework (RDF) as the graph data model; extraction of useful knowledge and information from the combined data using SPARQL as the semantic query language.
引用
收藏
页码:521 / 528
页数:8
相关论文
共 50 条
  • [1] Integrating Big Data: A Semantic Extract-Transform-Load Framework
    Bansal, Srividya K.
    Kagemann, Sebastian
    [J]. COMPUTER, 2015, 48 (03) : 42 - 50
  • [2] SETL: A programmable semantic extract-transform-load framework for semantic data warehouses
    Deb Nath, Rudra Pratap
    Hose, Katja
    Pedersen, Torben Bach
    Romero, Oscar
    [J]. INFORMATION SYSTEMS, 2017, 68 : 17 - 43
  • [3] SAT-ETL-Integrator: an extract-transform-load software for satellite big data ingestion
    Boudriki Semlali, Badr-Eddine
    El Amrani, Chaker
    Ortiz, Guadalupe
    [J]. JOURNAL OF APPLIED REMOTE SENSING, 2020, 14 (01)
  • [4] Research on data center construction based on extract-transform-load (ETL)
    Cai, Li
    Su, Jianying
    [J]. AGRO FOOD INDUSTRY HI-TECH, 2017, 28 (03): : 947 - 949
  • [5] Bringing Business Objects into Extract-Transform-Load (ETL) Technology
    Morris, Huong
    Liao, Hui
    Padmanabhan, Sriram
    Srinivasan, Sriram
    Lau, Phay
    Shan, Jing
    Wisnesky, Ryan
    [J]. PROCEEDINGS OF THE ICEBE 2008: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, 2008, : 709 - 714
  • [6] Data discovery method for Extract-Transform-Load
    Madhikermi, Manik
    Framling, Kary
    [J]. 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON MECHANICAL AND INTELLIGENT MANUFACTURING TECHNOLOGIES (ICMIMT 2019), 2019, : 174 - 181
  • [7] Data Model Logger - Data Discovery for Extract-Transform-Load
    Madhikermi, Manik
    Buda, Andrea
    Dave, Bhargav
    Framling, Kary
    [J]. 2017 19TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS (HPCC) / 2017 15TH IEEE INTERNATIONAL CONFERENCE ON SMART CITY (SMARTCITY) / 2017 3RD IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (DSS), 2017, : 629 - 630
  • [8] Extract-Transform-Load for Video Streams
    Kossmann, Ferdi
    Wu, Ziniu
    Lai, Eugenie
    Tatbul, Nesime
    Cao, Lei
    Kraska, Tim
    Madden, Sam
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (09): : 2302 - 2315
  • [9] Testing Extract-Transform-Load Process in Data Warehouse Systems
    Homayouni, Hajar
    [J]. 2018 29TH IEEE INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS (ISSREW), 2018, : 158 - 161
  • [10] A Survey of Extract-Transform-Load Technology
    Vassiliadis, Panos
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2009, 5 (03) : 1 - 27