A Methodology for Aligning Categories from Open Government Data Portals to a Comprehensive Set of Categories

被引:0
|
作者
Pinto, Higor [1 ]
Barcellos, Raissa [1 ]
Bernardini, Flavia [1 ]
Viterbo, Jose [1 ]
机构
[1] Fluminense Fed Univ, Inst Comp, Niteroi, RJ, Brazil
来源
关键词
Open Government Data; Open data categories; Categories alignment; Open data integration;
D O I
10.1007/978-3-031-15086-9_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, we can find Open Government Data Portals (OGDPs) from different governmental organizations. The increasing number of OGDPs brings opportunities for integrating these data. The vast majority of OGDPs distribute the data into different categories, and each portal uses its own set of categories. Therefore, due to the lack of homogeneous guidelines in data management, interconnecting data is flawed. Data integration is crucial in improving the functioning and planning of open government data portals. One possibility for integrating data from different portals is using the categories associated with datasets. Putting similar datasets in the same category can turn data integration easier. We propose a methodology for constructing a Comprehensive Set of Categories (CSC) extracted from different OGDPs and aligning different categories to this minimal and comprehensive set based on semantic similarity. We carried out an exploratory analysis on 100 portals of densely populated American cities using the categories collected in these portals in 2017. Our approach allowed us to align more than 80% of the collected portal categories to the minimal set according to more than 3 out of 6 similarity measures, which is promising for open data integration.
引用
收藏
页码:258 / 273
页数:16
相关论文
共 50 条
  • [1] Learning Categories from Linked Open Data
    Chen, Jesse Xi
    Reformat, Marek Z.
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS, PT III, 2014, 444 : 396 - 405
  • [2] From Conventional Open Government Data Portals to Storytelling Portals: The StoryOGD Prototype
    Chokki, Abiola Paterne
    Vanderose, Benoit
    [J]. TOGETHER IN THE UNSTABLE WORLD: DIGITAL GOVERNMENT AND SOLIDARITY, 2023, : 642 - 643
  • [3] Getting Critical Categories of a Data Set
    Jin, Cheqing
    Zhang, Yizhen
    Zhou, Aoying
    [J]. WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 169 - +
  • [4] Open Data Portals in Africa: An Analysis of Open Government Data Initiatives
    Bello, Olayiwola
    Akinwande, Victor
    Jolayemi, Oluwatoyosi
    Ibrahim, Ahmed
    [J]. AFRICAN JOURNAL OF LIBRARY ARCHIVES AND INFORMATION SCIENCE, 2016, 26 (02): : 97 - 106
  • [5] A Methodology for Retrieving Datasets from Open Government Data Portals Using Information Retrieval and Question and Answering Techniques
    Barcellos, Raissa
    Bernardini, Flavia
    Viterbo, Jose
    [J]. ELECTRONIC GOVERNMENT (EGOV 2020), 2020, 12219 : 239 - 249
  • [6] DISCOVERING NOVEL CATEGORIES IN SAR IMAGES IN OPEN SET CONDITIONS
    Dai, Liu
    Guo, Weiwei
    Zhang, Zenghui
    Yu, Wenxian
    [J]. 2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1932 - 1935
  • [7] Classi-Fly: Inferring Aircraft Categories from Open Data
    Strohmeier, Martin
    Smith, Matthew
    Lenders, Vincent
    Martinovic, Ivan
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (06)
  • [8] An International Analysis of the Quality of Open Government Data Portals
    Saez Martin, Alejandro
    Haro De Rosario, Arturo
    Caba Perez, Maria Del Carmen
    [J]. SOCIAL SCIENCE COMPUTER REVIEW, 2016, 34 (03) : 298 - 311
  • [9] Enabling Spatial Queries in Open Government Data Portals
    de Fernandes Vasconcelos, Pedro Arthur
    Alencar, Wensttay de Sousa
    da Silva Ribeiro, Victor Hugo
    Rodrigues, Natarajan Ferreira
    Andrade, Fabio de Gomes
    [J]. ELECTRONIC GOVERNMENT AND THE INFORMATION SYSTEMS PERSPECTIVE (EGOVIS 2017), 2017, 10441 : 64 - 79
  • [10] Open government data portals in the European Union: A dataset from 2015 to 2017
    de Juana-Espinosa, Susana
    Lujan-Mora, Sergio
    [J]. DATA IN BRIEF, 2020, 29