Diabetes Data Analysis and Prediction Model Discovery Using RapidMiner

被引:0
|
作者
Han, Jianchao [1 ]
Rodriguze, Juan C. [1 ]
Beheshti, Mohsen [1 ]
机构
[1] Calif Statement Univ Dominguez Hills, Dept Comp Sci, Carson, CA USA
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Data mining techniques have been extensively applied in bioinformatics to analyze biomedical data. In this paper, we choose the Rapid-I's RapidMiner as our tool to analyze a Pima Indians Diabetes Data Set, which collects the information of patients with and without developing diabetes. The discussion follows the data mining process. The focus will be on the data preprocessing, including attribute identification and selection, outlier removal, data normalization and numerical discretization, visual data analysis, hidden relationships discovery, and a diabetes prediction model construction.
引用
收藏
页码:1048 / 1051
页数:4
相关论文
共 50 条
  • [21] A Prediction Model to Diabetes Using Artificial Metaplasticity
    Marcano-Cedeno, Alexis
    Torres, Joaquin
    Andina, Diego
    NEW CHALLENGES ON BIOINSPIRED APPLICATIONS: 4TH INTERNATIONAL WORK-CONFERENCE ON THE INTERPLAY BETWEEN NATURAL AND ARTIFICIAL COMPUTATION, IWINAC 2011, PART II, 2011, 6687 : 418 - 425
  • [22] Using metagenomic data to boost protein structure prediction and discovery
    Hou, Qingzhen
    Pucci, Fabrizio
    Pan, Fengming
    Xue, Fuzhong
    Rooman, Marianne
    Feng, Qiang
    Computational and Structural Biotechnology Journal, 2022, 20 : 434 - 442
  • [23] Using metagenomic data to boost protein structure prediction and discovery
    Hou, Qingzhen
    Pucci, Fabrizio
    Pan, Fengming
    Xue, Fuzhong
    Rooman, Marianne
    Feng, Qiang
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 434 - 442
  • [24] Using metagenomic data to boost protein structure prediction and discovery
    Hou, Qingzhen
    Pucci, Fabrizio
    Pan, Fengming
    Xue, Fuzhong
    Rooman, Marianne
    Feng, Qiang
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 434 - 442
  • [25] Comparative Analysis of Premises Valuation Models Using KEEL, RapidMiner, and WEKA
    Graczyk, Magdalena
    Lasota, Tadeusz
    Trawinski, Bogdan
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: SEMANTIC WEB, SOCIAL NETWORKS AND MULTIAGENT SYSTEMS, 2009, 5796 : 800 - +
  • [26] Prediction Model of Sports Results Base on Knowledge Discovery in Data - base
    Zhao, Baojin
    Chen, Lei
    2016 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA 2016), 2016, : 288 - 291
  • [27] Spiral discovery of a separate prediction model from chronic hepatitis data
    Jumi, Masatoshi
    Suzuki, Einoshin
    Ohshima, Muneaki
    Zhong, Ning
    Yokoi, Hideto
    Takabayashi, Katsuhiko
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2007, 3609 : 464 - +
  • [28] Metadata Discovery Using Data Sampling and Exploratory Data Analysis
    Khalid, Hiba
    Wrembel, Robert
    Zimanyi, Esteban
    MODEL AND DATA ENGINEERING, MEDI 2019, 2019, 11815 : 106 - 120
  • [29] A New Transient Voltage Stability Prediction Model using Big Data Analysis
    Zhao, Bingbing
    Cao, Junwei
    Zhu, Ziyu
    Zhang, Huaying
    2016 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT-ASIA), 2016, : 1065 - 1069
  • [30] A Prediction Model based on Big Data Analysis Using Hybrid FCM Clustering
    Yang, Seokhwan
    Kim, Jaechun
    Chung, Mokdong
    2014 9TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2014, : 337 - 339