Multidimensional analysis model for a document warehouse that includes textual measures

被引:6
|
作者
Mendoza, Martha [1 ,2 ]
Alegria, Erwin [1 ]
Maca, Manuel [1 ]
Cobos, Carlos [1 ,2 ]
Leon, Elizabeth [3 ]
机构
[1] Univ Cauca, Informat Technol Res Grp GTI, Popayan, Colombia
[2] Univ Cauca, Elect & Telecommun Engn Fac, Popayan, Colombia
[3] Univ Nacl Colombia, Fac Engn, Medellin, Antioquia, Colombia
关键词
Document warehouse; OLAP; Textual measures; Text warehouse; ALGORITHM;
D O I
10.1016/j.dss.2015.02.008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data warehouses and On-Line Analytical Processing tools, OLAP, together permit a multi-dimensional analysis of structured data information. However, as business systems are increasingly required to handle substantial quantities of unstructured textual information, the need arises for an effective and similar means of analysis. To manage unstructured text data stored in data warehouses, a new multi-dimensional analysis model is proposed that includes textual measures as well as a topic hierarchy. In this model, the textual measures that associate the topics with the text documents are generated by Probabilistic Latent Semantic Analysis, while the hierarchy is created automatically using a clustering algorithm. Documents are then able to be queried using OLAP tools. The model was evaluated from two viewpoints query execution time and user satisfaction. Evaluation of execution time was carried out on scientific articles using two query types and user satisfaction (with query time and ease of use) using statistical frequency and multivariate analyses. Encouraging observations included that as the number of documents increases, query time increases as a lineal, rather than exponential tendency. In addition, the model gained an increasing acceptance with use, while the visualization of the model was also well received by users. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:44 / 59
页数:16
相关论文
共 50 条
  • [41] Gaussianizing the Earth: Multidimensional Information Measures for Earth Data Analysis
    Emmanuel Johnson, J.
    Laparra, Valero
    Piles, Maria
    Camps-Valls, Gustau
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2021, 9 (04) : 191 - 208
  • [42] Multidimensional Analysis and Location Intelligence Application for Spatial Data Warehouse Hotspot in Indonesia using SpagoBI
    Hasanah, Gamma Uswatun
    Trisminingsih, Rina
    [J]. WORKSHOP AND INTERNATIONAL SEMINAR ON SCIENCE OF COMPLEX NATURAL SYSTEMS, 2016, 31
  • [43] A two warehouse supply-chain model under possibility/necessity/credibility measures
    Das, B.
    Maity, K.
    Maiti, A.
    [J]. MATHEMATICAL AND COMPUTER MODELLING, 2007, 46 (3-4) : 398 - 409
  • [44] Data Warehouse System for Multidimensional Analysis of Tuition Fee Level in Higher Education Institutions in Indonesia
    Yulianto, Ardhian Agung
    Kasahara, Yoshiya
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (06) : 541 - 550
  • [45] Multidimensional Social Network: Model and Analysis
    Kazienko, Przemyslaw
    Musial, Katarzyna
    Kukla, Elzbieta
    Kajdanowicz, Tomasz
    Bródka, Piotr
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT I, 2011, 6922 : 378 - 387
  • [46] A conceptual model for multidimensional analysis of documents
    Ravat, Franck
    Teste, Olivier
    Tournier, Ronan
    Zurlfluh, Gilles
    [J]. CONCEPTUAL MODELING - ER 2007, PROCEEDINGS, 2007, 4801 : 550 - +
  • [47] A Novel Multidimensional Reference Model for Heterogeneous Textual Datasets using Context, Semantic and Syntactic Clues
    Kumar, Ganesh
    Basri, Shuib
    Imam, Abdullahi Abubakar
    Balogun, Abdullateef Oluwagbemiga
    Mamman, Hussaini
    Capretz, Luiz Fernando
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 754 - 763
  • [49] Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model
    Suh, Youngsuk
    [J]. JOURNAL OF EDUCATIONAL MEASUREMENT, 2016, 53 (04) : 403 - 430
  • [50] The Model of Semi-parametric Stochastic Frontier and its Calculation Based on Multidimensional Matrix and Data Warehouse
    Tong, Hengqing
    Lu, Xiaochuan
    Wang, Wenjuan
    [J]. ISBIM: 2008 INTERNATIONAL SEMINAR ON BUSINESS AND INFORMATION MANAGEMENT, VOL 1, 2009, : 457 - +