Automatic classification of data-warehouse-data for information lifecycle management using machine learning techniques

被引:4
|
作者
Buesch, Sebastian [1 ]
Nissen, Volker [1 ]
Wuenscher, Arndt [1 ]
机构
[1] Ilmenau Univ Technol, Ilmenau, Germany
关键词
Information lifecycle management; Machine learning; Computational intelligence; Artificial neural net; Multilayer perceptron; Automatic classification; Data warehouse; Business intelligence;
D O I
10.1007/s10796-016-9680-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The aim of Information Lifecycle Management (ILM) is to govern data throughout its lifecycle as efficiently as possible and effectively from technical points of view. A core aspect is the question, where the data should be stored, since different costs and access times are entailed. For this purpose data have to be classified, which presently is either done manually in an elaborate way, or with recourse to only a few data attributes, in particular access frequency. In the context of Data-Warehouse-Systems this article introduces an automated and therefore speedy and cost-effective data classification for ILM. Machine learning techniques, in particular an artificial neural network (multilayer perceptron), a support vector machine and a decision tree approach are compared on an SAP-based real-world data set from the automotive industry. This data classification considers a large number of data attributes and thus attains similar results akin to human experts. In this comparison of machine learning techniques, besides the accuracy of classification, also the types of misclassification that appear, are included, since this is important in ILM.
引用
收藏
页码:1085 / 1099
页数:15
相关论文
共 50 条
  • [1] Automatic classification of data-warehouse-data for information lifecycle management using machine learning techniques
    Sebastian Büsch
    Volker Nissen
    Arndt Wünscher
    [J]. Information Systems Frontiers, 2017, 19 : 1085 - 1099
  • [2] CLASSIFICATION OF RAIL SWITCH DATA USING MACHINE LEARNING TECHNIQUES
    Bryan, Kaylen J.
    Solomon, Mitchell
    Jensen, Emily
    Coley, Christina
    Rajan, Kailas
    Tian, Charlie
    Mijatovic, Nenad
    Kiss, James M.
    Lamoureux, Benjamin
    Dersin, Pierre
    Smith, Anthony O.
    Peter, Adrian M.
    [J]. PROCEEDINGS OF THE ASME JOINT RAIL CONFERENCE, 2018, 2018,
  • [3] Classification of rocks radionuclide data using machine learning techniques
    Khan, Abdul Razzaq
    Mir, Adil Aslam
    Saeed, Sharjil
    Rafique, Muhammad
    Asim, Khawaja M.
    Iqbal, Talat
    Jabbar, Abdul
    Rahman, Saeed Ur
    [J]. ACTA GEOPHYSICA, 2018, 66 (05) : 1073 - 1079
  • [4] Using machine learning techniques for exploration and classification of laboratory data
    Trulson, Inga
    Holdenrieder, Stefan
    Hoffmann, Georg
    [J]. JOURNAL OF LABORATORY MEDICINE, 2024,
  • [5] Classification of Diabetic Patient Data Using Machine Learning Techniques
    Singh, Pankaj Pratap
    Prasad, Shitala
    Das, Bhaskarjyoti
    Poddar, Upasana
    Choudhury, Dibarun Roy
    [J]. AMBIENT COMMUNICATIONS AND COMPUTER SYSTEMS, RACCCS 2017, 2018, 696 : 427 - 436
  • [6] Classification of rocks radionuclide data using machine learning techniques
    Abdul Razzaq Khan
    Adil Aslam Mir
    Sharjil Saeed
    Muhammad Rafique
    Khawaja M. Asim
    Talat Iqbal
    Abdul Jabbar
    Saeed Ur Rahman
    [J]. Acta Geophysica, 2018, 66 : 1073 - 1079
  • [7] Design of A Data Warehouse for Medical Information System Using Data Mining Techniques
    Farooqui, Nafees Akhter
    Mehra, Ritika
    [J]. 2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 199 - 203
  • [8] Classification of melanoma from Dermoscopic data using machine learning techniques
    Janney J, Bethanney
    Roslin, S. Emalda
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (5-6) : 3713 - 3728
  • [9] Classification of melanoma from Dermoscopic data using machine learning techniques
    Bethanney Janney.J
    S.Emalda Roslin
    [J]. Multimedia Tools and Applications, 2020, 79 : 3713 - 3728
  • [10] Automatic Reverse Engineering of CAN Bus Data Using Machine Learning Techniques
    Huybrechts, Thomas
    Vanommeslaeghe, Yon
    Blontrock, Dries
    Van Barel, Gregory
    Hellinckx, Peter
    [J]. ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING (3PGCIC-2017), 2018, 13 : 751 - 761