A Scientific Knowledge Discovery and Data Mining Process Model for Metabolomics

被引:6
|
作者
Banimustafa, Ahmed [1 ]
Hardy, Nigel [2 ]
机构
[1] ISRA Univ, Dept Software Engn, Amman 11622, Jordan
[2] Aberystwyth Univ, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
来源
IEEE ACCESS | 2020年 / 8卷
关键词
Metabolomics; Data mining; Data models; Knowledge discovery; Data analysis; Analytical models; Software; bioinformatics; computational biology; knowledge discovery; machine learning; metabolomics data analysis; process engineering; software engineering; MINIMUM REPORTING STANDARDS; PLANT METABOLOMICS; FUNCTIONAL GENOMICS; KDD PROCESS; ONTOLOGY; SYSTEMS; FRAMEWORK; TOOLS; WORK;
D O I
10.1109/ACCESS.2020.3039064
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work presents a scientific data mining process model for metabolomics that provides a systematic and formalised framework for guiding and performing metabolomics data analysis in a justifiable and traceable manner. The process model is designed to promote the achievement of the analytical objectives of metabolomics investigations and to ensure the validity, interpretability and reproducibility of their results. It satisfies the requirements of metabolomics data mining, focuses on the contextual meaning of metabolomics knowledge, and addresses the shortcomings of existing data mining process models, while paying attention to the practical aspects of metabolomics investigations and other desirable features. The process model development involved investigating the ontologies and standards of science, data mining and metabolomics and its design was based on the principles, best practices and inspirations from Process Engineering, Software Engineering, Scientific Methodology and Machine Learning. A software environment was built to realise and automate the process model execution and was then applied to a number of metabolomics datasets to demonstrate and evaluate its applicability to different metabolomics investigations, approaches and data acquisition instruments on one hand, and to different data mining approaches, goals, tasks and techniques on the other. The process model was successful in satisfying the requirements of metabolomics data mining and can be generalised to perform data mining in other scientific disciplines.
引用
收藏
页码:209964 / 210005
页数:42
相关论文
共 50 条
  • [1] Toward an integrated knowledge discovery and data mining process model
    Sharma, Sumana
    Osei-Bryson, Kweku-Muata
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2010, 25 (01): : 49 - 67
  • [2] Evaluation of an integrated Knowledge Discovery and Data Mining process model
    Sharma, Sumana
    Osei-Bryson, Kweku-Muata
    Kasper, George M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (13) : 11335 - 11348
  • [3] A Knowledge Discovery and Data Mining Process Model in E-Marketing
    Zeng, Huifang
    Pan, Ding
    [J]. 2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 3960 - 3964
  • [4] Knowledge discovery process for scientific and engineering data
    Barrios, LJ
    Rudolph, S
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 118 - 125
  • [5] Data mining and knowledge discovery in databases: Implications for scientific databases
    Fayyad, U
    [J]. NINTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 1997, : 2 - 11
  • [6] Fuzzy Trends Data Mining in Knowledge Discovery Process
    Yarushkina, Nadezda
    Afanasieva, Tatiana
    Zavarzin, Denis
    Guskov, Gleb
    [J]. CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, CIT&DS 2015, 2015, 535 : 115 - 123
  • [7] A survey of knowledge discovery and data mining process models
    Kurgan, Lukasz A.
    Musilek, Petr
    [J]. KNOWLEDGE ENGINEERING REVIEW, 2006, 21 (01): : 1 - 24
  • [8] A practical knowledge discovery process for distributed data mining
    Liu, JB
    Han, J
    [J]. INTELLIGENT SYSTEMS, 2002, : 11 - 16
  • [9] Knowledge discovery through mining process operational data
    Wang, XZ
    [J]. APPLICATION OF NEURAL NETWORKS AND OTHER LEARNING TECHNOLOGIES IN PROCESS ENGINEERING, 2001, : 287 - 328
  • [10] Data mining for knowledge discovery in mining
    Golosinski, TS
    Hu, H
    [J]. MINE PLANNING AND EQUIPMENT SELECTION 2001, 2001, : 1011 - 1018