MiTexCube: MicroTextCluster Cube for Online Analysis of Text Cells and its Applications

被引:4
|
作者
Zhang, Duo [1 ]
Zhai, ChengXiang [1 ]
Han, Jiawei [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
基金
美国国家科学基金会;
关键词
MiTexCube; multidimensional text database; text mining;
D O I
10.1002/sam.11159
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fundamental problem of multidimensional text database analysis is efficient and effective support of various kinds of online applications, such as summarizing the content of a text cell or comparing the contents across multiple text cells. In this paper, we propose a new infrastructure called MicroTextCluster Cube (or MiTexCube) to support efficient online text analysis on multidimensional text databases by introducing micro-clusters of text documents as a compact representation of text content. Experimental results on real multidimensional text databases show that (i) MiTexCube can be materialized efficiently with reasonable overhead in space, and (ii) applications based on the proposed materialized MiTexCube are more efficient than the baseline method of direct analysis based on document units in each cell, without sacrificing much quality of analysis, and MiTexCube naturally accommodates flexible trade-off between efficiency and quality of analysis. (c) 2012 Wiley Periodicals, Inc.
引用
收藏
页码:243 / 259
页数:17
相关论文
共 50 条
  • [1] OLAP on Multidimensional Text Databases: Topic Network Cube and its Applications
    Zhang, Zhiyuan
    Wang, Hong
    Feng, Xingjie
    FILOMAT, 2018, 32 (05) : 1973 - 1982
  • [2] Research of tourism online public opinion analysis based on text cube model
    Dong, Jianfeng
    Xiao, Liyan
    Guo, Xin
    2015 3RD INTERNATIONAL CONFERENCE ON SOFT COMPUTING IN INFORMATION COMMUNICATION TECHNOLOGY (SCICT 2015), 2015, : 95 - 99
  • [3] Topic modeling for OLAP on multidimensional text databases: Topic cube and its applications
    Zhang, Duo
    Zhai, ChengXiang
    Han, Jiawei
    Srivastava, Ashok
    Oza, Nikunj
    Statistical Analysis and Data Mining, 2009, 2 (5-6): : 378 - 395
  • [4] Content Analysis of Digital Text and Its Applications
    Agnihotri, Anustubh
    Verma, Rahul
    STUDIES IN INDIAN POLITICS, 2019, 7 (01) : 83 - 89
  • [5] The Cube lattice model and its applications
    Chaudron, L
    Maille, N
    Boyer, M
    APPLIED ARTIFICIAL INTELLIGENCE, 2003, 17 (03) : 207 - 242
  • [6] Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
    Lin, Cindy Xide
    Ding, Bolin
    Han, Jiawei
    Zhu, Feida
    Zhao, Bo
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 905 - 910
  • [7] POS Tagging and Its Applications for Mathematics Text Analysis in Mathematics
    Schoeneberg, Ulf
    Sperber, Wolfram
    INTELLIGENT COMPUTER MATHEMATICS, CICM 2014, 2014, 8543 : 213 - 223
  • [8] Text Mining and Its Applications
    Guo, Shengyu
    Cao, Buyang
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENT COMMUNICATION, 2015, 16 : 72 - 78
  • [9] THE SEMANTIC DATA CUBE SYSTEM PLATO AND ITS APPLICATIONS
    Bilidas, Dimitris
    Mantas, Anastasios
    Yfantis, Filippos
    Stamoulis, George
    Koubarakis, Manolis
    Habas, Jose Maria Tarraga
    Marco, Eva Sevillano
    Castel, Fabien
    Laine, Camille
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2514 - 2518
  • [10] Relational-Situational Method for Text Search and Analysis and Its Applications
    Osipov, G. S.
    Smirnov, I. V.
    Tikhomirov, I. A.
    SCIENTIFIC AND TECHNICAL INFORMATION PROCESSING, 2010, 37 (06) : 432 - 437