Data preparation for data mining

被引:0
|
作者
Zhang, SC
Zhang, CQ
Yang, Q
机构
[1] Univ Technol Sydney, Fac Informat Technol, Sydney, NSW 2007, Australia
[2] Hong Kong Univ Sci & Technol, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
关键词
D O I
10.1080/713827180
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data preparation is a fundamental stage of data analysis. While a lot of low-quality information is available in various data sources and on the Web, many organizations or companies are interested in how to transform the data into cleaned forms which can be used for high-profit purposes. This goal generates an urgent need for data analysis aimed at cleaning the raw data. In this paper, we first show the importance of data preparation in data analysis, then introduce some research achievements in the area of data preparation. Finally, we suggest some future directions of research and development.
引用
收藏
页码:375 / 381
页数:7
相关论文
共 50 条
  • [1] RADAR DATA PREPARATION FOR DATA MINING
    Keller, David
    Ondryhal, Vojtech
    ICMT '07: INTERNATIONAL CONFERENCE ON MILITARY TECHNOLOGIES, 2007, : 622 - 628
  • [2] Preparation of Distributed Heterogeneous Data for Data Mining
    Batasova, Svetlana
    Efimova, Maria
    Kholod, Ivan
    Semenchenko, Alexey
    2015 XVIII International Conference on Soft Computing and Measurements (SCM), 2015, : 205 - 207
  • [3] Integration and Automation of Data Preparation and Data Mining
    Narayanan, Shrikanth
    Jaiswal, Ayush
    Chiang, Yao-Yi
    Geng, Yanhui
    Knoblock, Craig A.
    Szekely, Pedro
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 1076 - 1085
  • [4] POP: A Parallel Optimized Preparation of Data for Data Mining
    Ernst, Christian
    Hmamouche, Youssef
    Casali, Alain
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 36 - 45
  • [5] Data Preparation for Data Mining in Chemical Plants using Big Data
    Borrison, Reuben
    Kloepper, Benjamin
    Mullen, Jennifer
    2019 IEEE 17TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2019, : 1185 - 1191
  • [6] Data preparation in web log mining
    Lu, Lina
    Yang, Yiling
    Guan, Xudong
    Wei, Hengyi
    Jisuanji Gongcheng/Computer Engineering, 2000, 26 (04): : 66 - 67
  • [7] Data preparation using data quality matrices for classification mining
    Davidson, Ian
    Tayi, Giri
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 197 (02) : 764 - 772
  • [8] A Data Preparation Methodology in Data Mining Applied to Mortality Population Databases
    Joaquín Pérez
    Emmanuel Iturbide
    Víctor Olivares
    Miguel Hidalgo
    Alicia Martínez
    Nelva Almanza
    Journal of Medical Systems, 2015, 39
  • [9] A Data Preparation Methodology in Data Mining Applied to Mortality Population Databases
    Perez, Joaquin
    Iturbide, Emmanuel
    Olivares, Victor
    Hidalgo, Miguel
    Almanza, Nelva
    Martinez, Alicia
    NEW CONTRIBUTIONS IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1, PT 1, 2015, 353 : 1173 - 1182
  • [10] A Data Preparation Methodology in Data Mining Applied to Mortality Population Databases
    Perez, Joaquin
    Iturbide, Emmanuel
    Olivares, Victor
    Hidalgo, Miguel
    Martinez, Alicia
    Almanza, Nelva
    JOURNAL OF MEDICAL SYSTEMS, 2015, 39 (11)