OLAP on Multidimensional Text Databases: Topic Network Cube and its Applications

被引:0
|
作者
Zhang, Zhiyuan [1 ]
Wang, Hong [1 ]
Feng, Xingjie [1 ]
机构
[1] Civil Aviat Univ China, Sch Comp Sci & Technol, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
multidimensional text database; topic network cube; OLAP; text mining; complex network;
D O I
10.2298/FIL1805973Z
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Multidimensional text data contains both structured attributes and unstructured text. Unlike the traditional numerical data, it is not straightforward to apply online analytical processing on multidimensional text data. Although some OLAP methods such as topic cube have been proposed in order to effectively utilize its structured information and valuable text data, these methods cant tell the relations of topic words. Considering that topics usually consist of several subtopics and each subtopic usually contains some topic words, we here use a topic network manner, in which related topic words are connected, to express the complex relations of topics. This paper introduces a new concept of topic network cube to perform OLAP analysis on multidimensional text databases. Firstly, we propose a method called GL-LDA based on Gibbs sampling outputs of Labeled LDA to measure the relations between topic words. Secondly, we give a storage model of topic network cube which can efficiently generate topic network using GL-LDA. Thirdly, we show how to perform OLAP analysis on topic network cube. Experimental results show that we can analyze multidimensional text databases in different granularity easily and effectively using just a few simple SQL statements, and the output network provides rich and useful information of topics.
引用
收藏
页码:1973 / 1982
页数:10
相关论文
共 50 条
  • [1] Topic modeling for OLAP on multidimensional text databases: Topic cube and its applications
    Zhang, Duo
    Zhai, ChengXiang
    Han, Jiawei
    Srivastava, Ashok
    Oza, Nikunj
    [J]. Statistical Analysis and Data Mining, 2009, 2 (5-6): : 378 - 395
  • [2] Extracting Dimensions for OLAP on Multidimensional Text Databases
    Zhang, Chao
    Wang, Xinjun
    Peng, Zhaohui
    [J]. WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 272 - 281
  • [3] ANALYSIS OF MULTIDIMENSIONAL OLAP DATA CUBE
    Codreanu, Diana-Elena
    Radut, Carmen
    Popa, Ionela
    Parpandel, Denisa
    [J]. 17TH INTERNATIONAL CONFERENCE - THE KNOWLEDGE-BASED ORGANIZATION: APPLIED TECHNICAL SCIENCES AND ADVANCED MILITARY TECHNOLOGIES, CONFERENCE PROCEEDING 3, 2011, : 251 - 255
  • [4] NetCube: a comprehensive network traffic analysis model based on multidimensional OLAP data cube
    Park, Daihee
    Yu, Jaehak
    Park, Jun-Sang
    Kim, Myung-Sup
    [J]. INTERNATIONAL JOURNAL OF NETWORK MANAGEMENT, 2013, 23 (02) : 101 - 118
  • [5] A framework for a multidimensional OLAP model using Topic Maps
    Bruckner, RM
    Ling, TW
    Mangisengi, O
    Tjoa, AM
    [J]. SECOND INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL 2, PROCEEDINGS, 2002, : 109 - 118
  • [6] Modeling multidimensional databases, cubes and cube operations
    Vassiliadis, P
    [J]. TENTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT - PROCEEDINGS, 1998, : 53 - 62
  • [7] Applying Object-Oriented Conceptual Modeling techniques to the design of multidimensional databases and OLAP applications
    Trujillo, JC
    Palomar, M
    Gómez, J
    [J]. WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 83 - 94
  • [8] Extendible arrays for statistical databases and OLAP applications
    Rotem, D
    Zhao, JL
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE SYSTEMS, PROCEEDINGS, 1996, : 108 - 117
  • [9] Convex cube: Towards a unified structure for multidimensional databases
    Casali, Alain
    Nedjar, Sebastien
    Cicchetti, Rosine
    Lakhal, Lotfi
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, 4653 : 572 - +
  • [10] Experiments on remote sensing image cube and its OLAP
    Xu, MJ
    [J]. IGARSS 2004: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM PROCEEDINGS, VOLS 1-7: SCIENCE FOR SOCIETY: EXPLORING AND MANAGING A CHANGING PLANET, 2004, : 4398 - 4401