OLAP on Multidimensional Text Databases: Topic Network Cube and its Applications

被引:0
|
作者
Zhang, Zhiyuan [1 ]
Wang, Hong [1 ]
Feng, Xingjie [1 ]
机构
[1] Civil Aviat Univ China, Sch Comp Sci & Technol, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
multidimensional text database; topic network cube; OLAP; text mining; complex network;
D O I
10.2298/FIL1805973Z
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Multidimensional text data contains both structured attributes and unstructured text. Unlike the traditional numerical data, it is not straightforward to apply online analytical processing on multidimensional text data. Although some OLAP methods such as topic cube have been proposed in order to effectively utilize its structured information and valuable text data, these methods cant tell the relations of topic words. Considering that topics usually consist of several subtopics and each subtopic usually contains some topic words, we here use a topic network manner, in which related topic words are connected, to express the complex relations of topics. This paper introduces a new concept of topic network cube to perform OLAP analysis on multidimensional text databases. Firstly, we propose a method called GL-LDA based on Gibbs sampling outputs of Labeled LDA to measure the relations between topic words. Secondly, we give a storage model of topic network cube which can efficiently generate topic network using GL-LDA. Thirdly, we show how to perform OLAP analysis on topic network cube. Experimental results show that we can analyze multidimensional text databases in different granularity easily and effectively using just a few simple SQL statements, and the output network provides rich and useful information of topics.
引用
收藏
页码:1973 / 1982
页数:10
相关论文
共 50 条
  • [31] On the multidimensional extension of countermonotonicity and its applications
    Lee, Woojoo
    Ahn, Jae Youn
    INSURANCE MATHEMATICS & ECONOMICS, 2014, 56 : 68 - 79
  • [32] Multidimensional Watson lemma and its applications
    Rytova, A. I.
    Yarovaya, E. B.
    MATHEMATICAL NOTES, 2016, 99 (3-4) : 406 - 412
  • [33] Multidimensional Watson lemma and its applications
    A. I. Rytova
    E. B. Yarovaya
    Mathematical Notes, 2016, 99 : 406 - 412
  • [34] Multi-center, Multi-topic Heart Sound Databases and their Applications
    Xie, Meilan
    Xiao, Shouzhong
    Liu, Tianhu
    Yi, Qijian
    You, FengZhi
    Guo, Xingming
    Shao, Yong
    Huo, Junmimg
    Du, Deqi
    Xu, DongMei
    Wu, Wenzhu
    Xiao, Zifu
    Yang, Yong
    Guo, Weizhen
    JOURNAL OF MEDICAL SYSTEMS, 2012, 36 (01) : 33 - 40
  • [35] Multi-center, Multi-topic Heart Sound Databases and their Applications
    Meilan Xie
    Shouzhong Xiao
    Tianhu Liu
    Qijian Yi
    FengZhi You
    Xingming Guo
    Yong Shao
    Junmimg Huo
    Deqi Du
    DongMei Xu
    Wenzhu Wu
    Zifu Xiao
    Yong Yang
    Weizhen Guo
    Journal of Medical Systems, 2012, 36 : 33 - 40
  • [36] THE SEMANTIC DATA CUBE SYSTEM PLATO AND ITS APPLICATIONS
    Bilidas, Dimitris
    Mantas, Anastasios
    Yfantis, Filippos
    Stamoulis, George
    Koubarakis, Manolis
    Habas, Jose Maria Tarraga
    Marco, Eva Sevillano
    Castel, Fabien
    Laine, Camille
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2514 - 2518
  • [38] On-Line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking
    AlSumait, Loulwah
    Barbara, Daniel
    Domeniconi, Carlotta
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 3 - 12
  • [39] Utilizing Recurrent Neural Network for topic discovery in short text scenarios
    Lu, Heng-Yang
    Kang, Ning
    Li, Yun
    Zhan, Qian-Yi
    Xie, Jun-Yuan
    Wang, Chong-Jun
    INTELLIGENT DATA ANALYSIS, 2019, 23 (02) : 259 - 277
  • [40] Region Reinforcement Network With Topic Constraint for Image-Text Matching
    Wu, Jie
    Wu, Chunlei
    Lu, Jing
    Wang, Leiquan
    Cui, Xuerong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 388 - 397