Outlier Mining in Medical Databases: An Application of Data Mining in Health Care Management to Detect Abnormal Values Presented In Medical Databases

被引:0
|
作者
Kumar, Varun [1 ]
Kumar, Dharminder [2 ]
Singh, R. K. [3 ]
机构
[1] Inst Technol & Management, Gurgaon, Haryana, India
[2] Guru Jambheshwar Univ Sci & Technol, Hisar, Haryana, India
[3] MP Bhoj Open Univ, Bhopal, Madhya Pradesh, India
关键词
Health Care; Breast Cancer; Data Mining; Outlier Mining; TANAGRA;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Outliers in medical databases can be caused by measurement errors or may be the result of inherent data variability. The abnormal value of mitoses, for instance, could lead to the diagnosis of malignant cancer or it might just be due to human mistake or execution error. In this paper, we make use of a large database, namely, Wisconsin Breast Cancer Database containing 10 attributes and 699 instances to detect outliers. Many data mining algorithms try to minimize the influence of outliers which could result in the loss of important hidden information since "one person's noise could be another person's signal". In particular, we used TANAGRA (A Data Mining Tool) to detect outliers from Breast Cancer Database and analyzed them for knowledge discovery. The results of the experiment show that outlier mining i.e. outlier detection & analysis have a great potential to find useful information from health care databases which consequently helps decision makers to automate & quicken the process of decision making in clinical diagnosis as well as other domains of health care management.
引用
收藏
页码:272 / 277
页数:6
相关论文
共 50 条
  • [1] Data Mining in Multimodal Medical Databases
    Strungaru, Rodica
    Ungureanu, G. Mihaela
    Murri, Roberto
    Pasqualli, Clara
    Seidel, Klaus
    Datcu, Mihai
    Stanciu, Radu
    [J]. INTEGRATING BIOMEDICAL INFORMATION: FROM E-CELL TO E-PATIENT, 2006, : 85 - +
  • [2] Multi-relational data mining in medical databases
    Habrard, A
    Bernard, M
    Jacquenet, F
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2003, 2780 : 365 - 374
  • [3] Mining knowledge in medical image databases
    Perner, P
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY II, 2000, 4057 : 359 - 368
  • [4] Evaluation of the sensitivity of a medical data-mining application to the number of elements in small databases
    Smith, M. R.
    Wang, X.
    Rangayyan, R. M.
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2009, 4 (03) : 262 - 268
  • [5] Outlier Detection in Spatial Databases Using Clustering Data Mining
    Karmaker, Amitava
    Rahman, Syed M.
    [J]. PROCEEDINGS OF THE 2009 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, VOLS 1-3, 2009, : 1657 - +
  • [6] Introduction to the minitrack: Databases, data warehousing, and data mining in health care
    Information Systems and Decision Sciences, College of Business Administration, University of South Florida, Tampa
    FL, United States
    不详
    [J]. Proceedings of the Annual Hawaii International Conference on System Sciences, 2000, 2000-January
  • [7] A hybrid data mining approach for knowledge extraction and classification in medical databases
    Hassan, Syed Zahid
    Verma, Brijesh
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 503 - 508
  • [8] Data Mining on Distributed Medical Databases: Recent Trends and Future Directions
    Atilgan, Yasemin
    Dogan, Firat
    [J]. IT REVOLUTIONS, 2009, 11 : 216 - 224
  • [9] Medical data mining - Experience of knowledge discovery in two clinical databases
    Liu, CCH
    Chiang, IJ
    Li, YC
    [J]. AMIA 2002 SYMPOSIUM, PROCEEDINGS: BIOMEDICAL INFORMATICS: ONE DISCIPLINE, 2002, : 1085 - 1085
  • [10] A Novel Mixed Values k-Prototypes Algorithm with Application to Health Care Databases Mining
    Najjar, Ahmed
    Gagne, Christian
    Reinharz, Daniel
    [J]. 2014 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN HEALTHCARE AND E-HEALTH (CICARE), 2014, : 159 - 166