Extracting Dimensions for OLAP on Multidimensional Text Databases

被引:0
|
作者
Zhang, Chao [1 ]
Wang, Xinjun [1 ]
Peng, Zhaohui [1 ]
机构
[1] Shandong Univ Jinan, Sch Comp Sci & Technol, Jinan, Peoples R China
来源
关键词
OLAP; unstructured data; extracting algorithm;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the amount of textual information massively growing in various kinds of business systems and Internet, there are increasingly demands for analyzing both structured data and unstructured text data. Online Analysis Processing (OLAP) is effective for analyzing and mining structured data. However, while handling with unstructured data, it is powerless. After working on several information integration and data analysis applications, we have realized the defect of OLAP on text data analysis and use technical ways to handle this issue. In this paper, we propose a semi-supervised algorithm to extract dimensions and their members from textual information for the purpose of analyzing a huge set of textual data. We use straightforward measures to express analysis results. Experiment result shows that the extracting algorithm is valid and our approach has a high scalability and flexibility.
引用
收藏
页码:272 / 281
页数:10
相关论文
共 50 条
  • [1] OLAP on Multidimensional Text Databases: Topic Network Cube and its Applications
    Zhang, Zhiyuan
    Wang, Hong
    Feng, Xingjie
    FILOMAT, 2018, 32 (05) : 1973 - 1982
  • [2] Topic modeling for OLAP on multidimensional text databases: Topic cube and its applications
    Zhang, Duo
    Zhai, ChengXiang
    Han, Jiawei
    Srivastava, Ashok
    Oza, Nikunj
    Statistical Analysis and Data Mining, 2009, 2 (5-6): : 378 - 395
  • [3] Extracting semantics in OLAP databases using emerging cubes
    Nedjar, Sebastien
    Cicchetti, Rosine
    Lakhal, Lotfi
    INFORMATION SCIENCES, 2011, 181 (10) : 2036 - 2059
  • [4] OLAP and bibliographic databases
    Emil Hudomalj
    Gaj Vidmar
    Scientometrics, 2003, 58 : 609 - 622
  • [5] OLAP and bibliographic databases
    Hudomalj, E
    Vidmar, G
    SCIENTOMETRICS, 2003, 58 (03) : 609 - 622
  • [6] Multivariate and multidimensional OLAP
    Shao, SC
    ADVANCES IN DATABASE TECHNOLOGY - EDBT'98, 1998, 1377 : 120 - 134
  • [7] Multidimensional OLAP vs. relational OLAP
    White, C.
    InfoDB, 1996, 10 (02):
  • [8] A new multidimensional model with text dimensions: definition and implementation
    Martin-Bautista, Maria J.
    Molina, Carlos
    Tejeda, Elizabet
    Vila, Maria-Amparo
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2013, 6 (01): : 137 - 155
  • [9] A new multidimensional model with text dimensions: definition and implementation
    Maria J. Martin-Bautista
    Carlos Molina
    Elizabet Tejeda
    Maria-Amparo Vila
    International Journal of Computational Intelligence Systems, 2013, 6 : 137 - 155
  • [10] Big Data Conditional Business Rule Calculations in Multidimensional In-GPU-Memory OLAP Databases
    Haberstroh, Alexander
    Strohm, Peter
    NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS (ADBIS 2015), 2015, 539 : 291 - 304