Open Data Categorization Based on Formal Concept Analysis

被引:8
|
作者
Gligorijevic, Milena Frtunic [1 ]
Bogdanovic, Milos [1 ]
Veljkovic, Natasa [1 ]
Stoimenov, Leonid [1 ]
机构
[1] Univ Nis, Fac Elect Engn, Nish 18000, Serbia
关键词
Portals; Metadata; Government; Text categorization; Software; Formal concept analysis; Machine learning; e-government; open data; data categorization; formal concept analysis;
D O I
10.1109/TETC.2019.2919330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Government institutions have released a large number of datasets on their open data portals, which are in line with the data transparency and open government initiatives. With the purpose of making it more accessible and visible, these portals categorize datasets based on different criteria like publishers, categories, formats, and descriptions. However, some of this information is often missing, making it impossible to find datasets in all of these ways. As a result, with the number of datasets growing further on the portals, it is getting harder to obtain the desired information. This paper addresses this issue by introducing EODClassifier framework that suggests the best match for the category where a dataset should belong to. It relies on formal concept analysis as a means to generate a data structure that will reveal shared conceptualization originating from tags' usage and utilize it as a knowledge base to categorize uncategorized open datasets.
引用
收藏
页码:571 / 581
页数:11
相关论文
共 50 条
  • [1] Reductive data cube based on formal concept analysis
    Shi, Zhibin
    Huang, Houkuan
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (11): : 1956 - 1962
  • [2] Categorization of Multiple Documents Using Fuzzy Overlapping Clustering Based on Formal Concept Analysis
    Chen, Yi-Hui
    Lu, Eric Jui-Lin
    Cheng, Ya-Wen
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2020, 30 (05) : 631 - 647
  • [3] Data granulation and formal concept analysis
    Hashemi, RR
    De Agostino, S
    Westgeest, B
    Talburt, JR
    [J]. NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 79 - 83
  • [4] Distributed Architecture of Data Analysis System Based on Formal Concept Analysis Approach
    Neznanov, A. A.
    Parinov, A. A.
    [J]. INTELLIGENT DISTRIBUTED COMPUTING IX, IDC'2015, 2016, 616 : 265 - 271
  • [5] OpenFCA, an open source Formal Concept Analysis toolbox
    Borza, Paul Valentin
    Sabou, Ovidiu
    Sacarea, Christian
    [J]. PROCEEDINGS OF 2010 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2010), VOLS. 1-3, 2010,
  • [6] Evaluation of Stream Data by Formal Concept Analysis
    Radvansky, Martin
    Sklenar, Vladimir
    Snasel, Vaclav
    [J]. NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, 2013, 185 : 131 - +
  • [7] Biclustering Numerical Data in Formal Concept Analysis
    Kaytoue, Mehdi
    Kuznetsov, Sergei O.
    Napoli, Amedeo
    [J]. FORMAL CONCEPT ANALYSIS, 2011, 6628 : 135 - 150
  • [8] A data-driven approach to constructing an ontological concept hierarchy based on the formal concept analysis
    Hwang, Suk-Hyung
    Kim, Hong-Gee
    Kim, Myeng-Ki
    Choi, Sung-Hee
    Yang, Hae-Sool
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 4, 2006, 3983 : 937 - 946
  • [9] Analysis of Medical Data using Data Mining and Formal Concept Analysis
    Gupta, Anamika
    Kumar, Naveen
    Bhatnagar, Vasudha
    [J]. PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 6, 2005, : 253 - 256
  • [10] Mining association concept based on formal concept analysis
    Zhang, Zhuo
    Li, Shijun
    [J]. Journal of Computational Information Systems, 2010, 6 (03): : 783 - 792