Ontology Augmented Data Lake System for Policy Support

被引:1
|
作者
Kulkarni, Apurva [1 ]
Bassin, Pooja [1 ]
Parasa, Niharika Sri [1 ]
Venugopal, Vinu E. [1 ]
Srinivasa, Srinath [1 ]
Ramanathan, Chandrashekar [1 ]
机构
[1] Int Inst Informat Technol, 26-C Elect City Phase 1, Bangalore, Karnataka, India
来源
BIG DATA ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2022 | 2023年 / 13830卷
关键词
Big data; Ontology; Document retrieval; Data lake; Data analyses; Policy support system; Bayesian network; MANAGEMENT;
D O I
10.1007/978-3-031-28350-5_1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Analytics of Big Data in the absence of an accompanying framework of metadata can be a quite daunting task. While it is true that statistical algorithms can do large-scale analyses on diverse data with little support from metadata, using such methods on widely dispersed, extremely diverse, and dynamic data may not necessarily produce trustworthy findings. One such task is identifying the impact of indicators for various Sustainable Development Goals (SDGs). One of the methods to analyze impact is by developing a Bayesian network for the policymaker to make informed decisions under uncertainty. It is of key interest to policy-makers worldwide to rely on such models to decide the new policies of a state or a country (https://sdgs.un.org/2030agenda). The accuracy of the models can be improved by considering enriched data - often done by incorporating pertinent data from multiple sources. However, due to the challenges associated with volume, variety, veracity, and the structure of the data, traditional data lake systems fall short of identifying information that is syntactically diverse yet semantically connected. In this paper, we propose a Data Lake (DL) framework that targets ingesting & processing of data like any traditional DL, and in addition, is capable of performing data retrieval for applications such as Policy Support Systems (where the selection of data greatly affect the output interpretations) by using ontologies as the intermediary. We discuss the proof of concept for the proposed system and the preliminary results (IIITB Data Lake project Website link: http://cads.iiitb.ac.in/wordpress/) based on the data collected from the agriculture department of the Government of Karnataka (GoK).
引用
收藏
页码:3 / 16
页数:14
相关论文
共 50 条
  • [41] Open Data Lake to Support Machine Learning on Arctic Big Data
    Olawoyin, Anifat M.
    Leung, Carson K.
    Cuzzocrea, Alfredo
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5215 - 5224
  • [42] Augmented Ontology by Handshaking with Machine Learning
    Kim, Marie
    Kang, Hyunjoong
    Kwon, Soonhyun
    Lee, YongJoon
    Kim, Kwihoon
    Pyo, Cheol Sik
    2017 19TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATIONS TECHNOLOGY (ICACT) - OPENING NEW ERA OF SMART SOCIETY, 2017, : 740 - 743
  • [43] Towards a Usable Ontology: The Identification of Quality Characteristics for an Ontology-Driven Decision Support System
    Wilson, Shyama, I
    Goonetillake, Jeevani S.
    Ginige, Athula
    Walisadeera, Anusha, I
    IEEE ACCESS, 2022, 10 : 12889 - 12912
  • [44] Data-driven decision support system for building stocks energy retrofit policy
    Re Cecconi, Fulvio
    Khodabakhshian, Ania
    Rampini, Luca
    Journal of Building Engineering, 2022, 54
  • [45] Data-driven decision support system for building stocks energy retrofit policy
    Cecconi, Fulvio Re
    Khodabakhshian, Ania
    Rampini, Luca
    JOURNAL OF BUILDING ENGINEERING, 2022, 54
  • [46] Integrating Business Policy into System Engineering Processes using Ontology
    Azzam, Said Rabah
    Ayodele, Taiwo
    Vipoopinyo, Jarupa
    Zhou, Shikun
    JCPC: 2009 JOINT CONFERENCE ON PERVASIVE COMPUTING, 2009, : 331 - 334
  • [47] Data integration for research and innovation policy: an Ontology-Based Data Management approach
    Cinzia Daraio
    Maurizio Lenzerini
    Claudio Leporelli
    Henk F. Moed
    Paolo Naggar
    Andrea Bonaccorsi
    Alessandro Bartolucci
    Scientometrics, 2016, 106 : 857 - 871
  • [48] Data integration for research and innovation policy: an Ontology-Based Data Management approach
    Daraio, Cinzia
    Lenzerini, Maurizio
    Leporelli, Claudio
    Moed, Henk F.
    Naggar, Paolo
    Bonaccorsi, Andrea
    Bartolucci, Alessandro
    SCIENTOMETRICS, 2016, 106 (02) : 857 - 871
  • [49] SCIENCE POLICY - A SUPPORT SYSTEM IN EVOLUTION
    不详
    NATURE, 1986, 324 (6095) : 330 - 330
  • [50] Ontology Based Clinical Decision Support System for Diabetes Diagnostic
    Alharbi, Reham Faisal
    Berri, Jawad
    El-Masri, Samir
    2015 SCIENCE AND INFORMATION CONFERENCE (SAI), 2015, : 597 - 602