An OLAP data model driven approach to process statistical tables

被引:0
|
作者
Luk, WS [1 ]
Leung, P [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Statistical tables belong to an important subset of tables published in the web, because they represent up-to-date, vital information sources for decision makers. These tables are often carefully designed for easy reading by analysts, and then mechanically produced by an OLAP database system. The general practice of extracting attribute-value pairs from statistical tables does not ensure high accuracy when they are used as a database for an information retrieval system. In this paper, we show how a human may visualize a statistical table as an multidimensional object, defined by a suitably modified OLAP model. In this way, the keywords are classified into semantically distinct groups, i.e., dimension hierarchies, without any ontological knowledge or resorting to machine learning. A prototype system which mimics the human reasoning for table processing has been implemented Experiments on 150 randomly chosen tables from Statistics Canada have confirmed the validity of this approach.
引用
收藏
页码:1054 / 1058
页数:5
相关论文
共 50 条
  • [1] Modernizing Secure OLAP Applications with a Model-Driven Approach
    Blanco, Carlos
    Fernandez-Medina, Eduardo
    Trujillo, Juan
    COMPUTER JOURNAL, 2015, 58 (10): : 2351 - 2367
  • [2] Summarizability in OLAP and statistical data bases
    Lenz, HJ
    Shoshani, A
    NINTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 1997, : 132 - 143
  • [3] The research of an intelligent agent to process OLAP service over statistical data warehouse
    Yang, Wenchuan
    Wang, Peng
    Gao, Chunyang
    Fan, Yanyang
    Luan, Huahua
    PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 794 - 798
  • [4] A Model Driven Approach for Automatically Improving OLAP Legacy Applications with Security
    Blanco, Carlos
    Fernandez-Medina, Eduardo
    Trujillo, Juan
    WOSIS: PROCEEDINGS OF THE 10TH INTERNATIONAL WORKSHOP ON SECURITY IN INFORMATION SYSTEMS, 2013, : 76 - 85
  • [5] Rule Driven Spreadsheet Data Extraction from Statistical Tables: Case Study
    Paramonov, Viacheslav
    Shigarov, Alexey
    Vetrova, Varvara
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2021, 2021, 1486 : 84 - 95
  • [6] QB2OLAP: Enabling OLAP on Statistical Linked Open Data
    Varga, Jovan
    Etcheverry, Lorena
    Vaisman, Alejandro A.
    Romero, Oscar
    Pedersen, Torben Bach
    Thomsen, Christian
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1346 - 1349
  • [7] Model-driven metadata for OLAP cubes from the conceptual modelling of data warehouses
    Pardillo, Jesus
    Mazon, Jose-Norberto
    Trujillo, Juan
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2008, 5182 : 13 - 22
  • [8] A data driven approach handling regularization in statistical modelling
    Koukoudakis, Nikolaos
    Mitrouli, Marilena
    2022 12TH INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS, SERVICES AND TECHNOLOGIES (DESSERT), 2022,
  • [9] Statistical Data Processing with R - Metadata Driven Approach
    Seljak, Rudi
    Pikelj, Jerneja
    ROMANIAN STATISTICAL REVIEW, 2016, (02) : 71 - 78
  • [10] Application driven approach for the development of a data model standard for process plant operation
    Lu, ML
    Yang, AD
    Li, HS
    Wada, T
    COMPUTERS & CHEMICAL ENGINEERING, 2000, 24 (2-7) : 463 - 469