Warehousing structured and unstructured data for data mining

被引:0
|
作者
Miller, LL [1 ]
Honavar, V
Barta, T
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Ind Engn, Ames, IA 50011 USA
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
More data, especially unstructured data, is available to users than ever. There is so much data available that it is difficult for users to make use of their data in its raw form. To handle the diversity of data types, we have designed and prototyped a multidatabase/warehouse system. The system has been especially designed to facilitate the interaction of structured and unstructured data. The system makes use of object oriented views. The main features of the view mechanism, especially as they relate to textual documents, are presented in the paper. The system is designed to take target documents either from large repositories or from the Web. Issues for both sources of documents are examined in the paper. The paper also looks at how the view approach allows the interaction between the data taken from structured (e.g., relational), semistructured (e.g., object oriented) and unstructured (e.g. text) data sources. The warehouse support provided by the system is briefly examined and the paper concludes by looking at our approach to data mining and how the system will operate in the complete environment.
引用
收藏
页码:215 / 224
页数:10
相关论文
共 50 条
  • [41] An integration of data mining and data warehousing for hierarchical multimedia information retrieval
    You, J
    Dillon, T
    Liu, J
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 373 - 376
  • [42] List representation applied to sparse datacubes for data warehousing and data mining
    Wang, F
    Marir, F
    Gordon, J
    Helian, N
    [J]. INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 871 - 875
  • [43] The bibliomining process: Data warehousing and data mining for library decision making
    Nicholson, S
    [J]. INFORMATION TECHNOLOGY AND LIBRARIES, 2003, 22 (04) : 146 - 151
  • [44] Transforming corporate information into value through data warehousing and data mining
    Cheng, PS
    Chang, P
    [J]. ASLIB PROCEEDINGS, 1998, 50 (05): : 109 - 113
  • [45] Introduction to the minitrack: Databases, data warehousing, and data mining in health care
    Information Systems and Decision Sciences, College of Business Administration, University of South Florida, Tampa
    FL, United States
    不详
    [J]. Proceedings of the Annual Hawaii International Conference on System Sciences, 2000, 2000-January
  • [46] Multiagent data warehousing and multiagent data mining for cerebrum/cerebellum modeling
    Zhang, WR
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 261 - 271
  • [47] I/O problems in preparing data for data warehousing and data mining, Part 1
    Kim, W
    [J]. JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1999, 11 (09): : 13 - +
  • [48] Mining and fusing unstructured online reviews and structured public index data for hospital selection
    Liao, Huchang
    Qi, Jiaxin
    Zhang, Jiawei
    Zhang, Chonghui
    Liu, Fan
    Ding, Weiping
    [J]. INFORMATION FUSION, 2024, 103
  • [49] Graph integration of structured, semistructured and unstructured data for data journalism
    Anadiotis, Angelos Christos
    Balalau, Oana
    Conceicao, Catarina
    Galhardas, Helena
    Haddad, Mhd Yamen
    Manolescu, Ioana
    Merabti, Tayeb
    You, Jingmao
    [J]. INFORMATION SYSTEMS, 2022, 104
  • [50] Benchmarking Data Lakes Featuring Structured and Unstructured Data with DLBench
    Sawadogo, Pegdwende N.
    Darmont, Jerome
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2021), 2021, 12925 : 15 - 26