Proposal of effective preprocessing techniques of financial data

被引:0
|
作者
Abasova, Jela [1 ]
Janosik, Jan [1 ]
Simoncicova, Veronika [1 ]
Tanuska, Pavol [1 ]
机构
[1] Slovak Univ Technol Bratislava, Fac Mat Sci & Technol Trnava, Trnava, Slovakia
关键词
preprocessing; Big Data; Data Mining; BPMN diagram; CRISP-DM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The article focuses on describing process of customization data obtained from ERP software. The aim of this customization was ensuring usability of data collection for obtaining knowledge from data via methods of data mining. First part describes importance of preprocessing phase of working with big data sets. It deals with terms commonly used in the topic - methodology CRISP-DM and its phases, with closer look at individual steps of preprocessing. Second part contains description of the chosen dataset, extraction of raw data and their required format (according to their expected future use). The data are financial indications of a real concern. This part also focuses on methods and programs used for preprocessing the data. Third part corresponds with the main task of this article, what is adjusting obtained data to format which would be appropriate to further processing and analyzing purposes with Data Mining techniques. It describes five phases of preprocessing, modified in a way needed for manipulating with chosen data. The last part compiles acquired knowledge and presents further use of the data after transformation from raw form to demanded format. The paper concludes with BPMN diagram of the process with focusing on preprocessing (data preparation) stage.
引用
收藏
页码:293 / 298
页数:6
相关论文
共 50 条
  • [1] Financial Data Preprocessing Issues
    Lopata, Audrius
    Butleris, Rimantas
    Gudas, Saulius
    Rudzionis, Vytautas
    Rudzioniene, Kristina
    Zioba, Liutauras
    Veitaite, Ilona
    Dilijonas, Darius
    Grisius, Evaldas
    Zwitserloot, Maarten
    [J]. INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2021, 2021, 1486 : 60 - 71
  • [2] On preprocessing data for financial credit risk evaluation
    Piramuthu, S
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2006, 30 (03) : 489 - 497
  • [3] Facilitating data preprocessing by a generic framework: a proposal for clustering
    Kathrin Kirchner
    Jelena Zec
    Boris Delibašić
    [J]. Artificial Intelligence Review, 2016, 45 : 271 - 297
  • [4] Facilitating data preprocessing by a generic framework: a proposal for clustering
    Kirchner, Kathrin
    Zec, Jelena
    Delibasic, Boris
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2016, 45 (03) : 271 - 297
  • [5] Preprocessing DNS Log Data for Effective Data Mining
    Snyder, Mark E.
    Sundaram, Ravi
    Thakur, Mayur
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-8, 2009, : 1366 - +
  • [6] New data preprocessing trends based on ensemble of multiple preprocessing techniques
    Mishra, Puneet
    Biancolillo, Alessandra
    Roger, Jean Michel
    Marini, Federico
    Rutledge, Douglas N.
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2020, 132
  • [7] A comprehensive review on data preprocessing techniques in data analysis
    Cetin, Volkan
    Yildiz, Oktay
    [J]. PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2022, 28 (02): : 299 - 312
  • [8] Application of filtering techniques in preprocessing magnetic data
    Liu Haijun
    Yi Yongping
    Yang Hongxia
    Hu Guochuang
    Liu Guoming
    [J]. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND PATTERN RECOGNITION IN INDUSTRIAL ENGINEERING, 2010, 7820
  • [9] Data preprocessing techniques for classification without discrimination
    Faisal Kamiran
    Toon Calders
    [J]. Knowledge and Information Systems, 2012, 33 : 1 - 33
  • [10] Feature Detection Techniques for Preprocessing Proteomic Data
    Sellers, Kimberly F.
    Miecznikowski, Jeffrey C.
    [J]. INTERNATIONAL JOURNAL OF BIOMEDICAL IMAGING, 2010, 2010