OLAP on Multidimensional Text Databases: Topic Network Cube and its Applications

被引:0
|
作者
Zhang, Zhiyuan [1 ]
Wang, Hong [1 ]
Feng, Xingjie [1 ]
机构
[1] Civil Aviat Univ China, Sch Comp Sci & Technol, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
multidimensional text database; topic network cube; OLAP; text mining; complex network;
D O I
10.2298/FIL1805973Z
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Multidimensional text data contains both structured attributes and unstructured text. Unlike the traditional numerical data, it is not straightforward to apply online analytical processing on multidimensional text data. Although some OLAP methods such as topic cube have been proposed in order to effectively utilize its structured information and valuable text data, these methods cant tell the relations of topic words. Considering that topics usually consist of several subtopics and each subtopic usually contains some topic words, we here use a topic network manner, in which related topic words are connected, to express the complex relations of topics. This paper introduces a new concept of topic network cube to perform OLAP analysis on multidimensional text databases. Firstly, we propose a method called GL-LDA based on Gibbs sampling outputs of Labeled LDA to measure the relations between topic words. Secondly, we give a storage model of topic network cube which can efficiently generate topic network using GL-LDA. Thirdly, we show how to perform OLAP analysis on topic network cube. Experimental results show that we can analyze multidimensional text databases in different granularity easily and effectively using just a few simple SQL statements, and the output network provides rich and useful information of topics.
引用
收藏
页码:1973 / 1982
页数:10
相关论文
共 50 条
  • [21] The Cube lattice model and its applications
    Chaudron, L
    Maille, N
    Boyer, M
    APPLIED ARTIFICIAL INTELLIGENCE, 2003, 17 (03) : 207 - 242
  • [22] A Short Text Topic Discovery Method for Social Network
    Liu Jia
    Wang Qinglin
    Liu Yu
    Li Yuan
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 512 - 516
  • [23] Network Topic Detection Model Based on Text Reconstructions
    Zhu, Zhenfang
    Wang, Peipei
    Jia, Zhiping
    Xiao, Hairong
    Zhang, Guangyuan
    Liang, Hao
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2013, 37 (04): : 367 - 372
  • [24] Topic of the Imaginative Text and its Philosophical and Linguistic Presentation
    Blokh, Mark Yakovlevich
    Asratyan, Zoya Dmitrievna
    Asratyan, Norair Martinovich
    TARIH KULTUR VE SANAT ARASTIRMALARI DERGISI-JOURNAL OF HISTORY CULTURE AND ART RESEARCH, 2019, 8 (02): : 128 - 135
  • [25] Short Text Topic Modeling Techniques, Applications, and Performance: A Survey
    Qiang, Jipeng
    Qian, Zhenyu
    Li, Yun
    Yuan, Yunhao
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (03) : 1427 - 1445
  • [26] Text Classification of Network Pyramid Scheme based on Topic Model
    Mu, Pengyu
    He, Jingsha
    Zhu, Nafei
    NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 15 - 19
  • [27] Topic text detection by clustering algorithm for social network media
    Sha S.
    International Journal of Networking and Virtual Organisations, 2024, 30 (03) : 246 - 256
  • [28] Short Text Topic Learning Using Heterogeneous Information Network
    Wang, Qingren
    Zhu, Chengcheng
    Zhang, Yiwen
    Zhong, Hong
    Zhong, Jinqin
    Sheng, Victor S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5269 - 5281
  • [29] Text Mining and Its Applications
    Guo, Shengyu
    Cao, Buyang
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENT COMMUNICATION, 2015, 16 : 72 - 78
  • [30] Multidimensional TOMBO imaging and its applications
    Horisaki, Ryoichi
    Tanida, Jun
    UNCONVENTIONAL IMAGING, WAVEFRONT SENSING, AND ADAPTIVE CODED APERTURE IMAGING AND NON-IMAGING SENSOR SYSTEMS, 2011, 8165