Automated machine learning tool: The first stop for data science and statistical model building

被引:0
|
作者
Gopagoni D. [1 ]
Lakshmi P.V. [1 ]
机构
[1] Department of Computer Science and Engineering, GIT GITAM (Deemed to be University), Vishakhapatnam, Andhra Pradesh
关键词
Artificial neural networks; Automated machine learning; Drug design; K-means clustering; Market analysis; Naive bayes classification; QSAR; QSPR; R program; Regression models; Shiny web app; Supervised learning; Support vector machines;
D O I
10.14569/ijacsa.2020.0110253
中图分类号
学科分类号
摘要
Machine learning techniques are designed to derive knowledge out of existing data. Increased computational power, use of natural language processing, image processing methods made easy creation of rich data. Good domain knowledge is required to build useful models. Uncertainty remains around choosing the right sample data, variables reduction and selection of statistical algorithm. A suitable statistical method coupled with explaining variables is critical for model building and analysis. There are multiple choices around each parameter. An automated system which could help the scientists to select an appropriate data set coupled with learning algorithm will be very useful. A freely available web-based platform, named automated machine learning tool (AMLT), is developed in this study. AMLT will automate the entire model building process. AMLT is equipped with all most commonly used variable selection methods, statistical methods both for supervised and unsupervised learning. AMLT can also do the clustering. AMLT uses statistical principles like R2 to rank the models and automatic test set validation. Tool is validated for connectivity and capability by reproducing two published works. © Science and Information Organization.
引用
收藏
页码:410 / 418
页数:8
相关论文
共 50 条
  • [41] Big data and machine learning for materials science
    Rodrigues J.F., Jr.
    Florea L.
    de Oliveira M.C.F.
    Diamond D.
    Oliveira O.N., Jr.
    Discover Materials, 1 (1):
  • [42] Small data machine learning in materials science
    Xu, Pengcheng
    Ji, Xiaobo
    Li, Minjie
    Lu, Wencong
    NPJ COMPUTATIONAL MATERIALS, 2023, 9 (01)
  • [43] Small data machine learning in materials science
    Pengcheng Xu
    Xiaobo Ji
    Minjie Li
    Wencong Lu
    npj Computational Materials, 9
  • [44] Machine Learning and Data Science in Chemical Engineering
    Gao, Hanyu
    Zhu, Li-Tao
    Luo, Zheng-Hong
    Fraga, Marco A.
    Hsing, I-Ming
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2022, 61 (24) : 8357 - 8358
  • [45] Earthquake Prediction Model Based on Geomagnetic Field Data Using Automated Machine Learning
    Yusof, Khairul Adib
    Mashohor, Syamsiah
    Abdullah, Mardina
    Abd Rahman, Mohd Amiruddin
    Hamid, Nurul Shazana Abdul
    Qaedi, Kasyful
    Matori, Khamirul Amin
    Hayakawa, Masashi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [46] Version [2.0]- [AMLBID: An auto-explained Automated Machine Learning tool for Big Industrial Data]
    Garouani, Moncef
    Bouneffa, Mourad
    Ahmad, Adeel
    Hamlich, Mohamed
    SOFTWAREX, 2023, 23
  • [47] BUILDING A TOOL FOR SOFTWARE CODE ANALYSIS - A MACHINE LEARNING APPROACH
    FOUQUE, G
    VRAIN, C
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 593 : 278 - 289
  • [48] Automated Retrieval of Heterogeneous Proteomic Data for Machine Learning
    Rafay, Abdul
    Aziz, Muzzamil
    Zia, Amjad
    Asif, Abdul R. R.
    JOURNAL OF PERSONALIZED MEDICINE, 2023, 13 (05):
  • [49] AUTOMATED MACHINE LEARNING & SYNTHETIC DATA APPLICATIONS IN MEDICINE
    Rashidi, Hooman
    INTERNATIONAL JOURNAL OF LABORATORY HEMATOLOGY, 2023, 45 : 93 - 93
  • [50] Adaptation Strategies for Automated Machine Learning on Evolving Data
    Celik, Bilge
    Vanschoren, Joaquin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (09) : 3067 - 3078