MINING SURPRISING PATTERNS AND THEIR EXPLANATIONS IN CLINICAL DATA

被引:9
|
作者
Kuo, Yen-Ting [1 ]
Lonie, Andrew [1 ]
Pearce, Adrian R. [1 ]
Sonenberg, Liz [1 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, Melbourne, Vic 3010, Australia
关键词
INTERESTINGNESS MEASURES; KNOWLEDGE DISCOVERY; UNEXPECTED PATTERNS;
D O I
10.1080/08839514.2014.875679
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new technique for interactively mining patterns and generating explanations by harnessing the expertise of domain experts. Key to the approach is the distinction between what is unexpected from the perspective of the computational data mining process and what is surprising to the domain experts and interesting relative to their needs. We demonstrate the potential of the approach for discovering patterns and generating rich explanations in a clinical domain. Discovering interesting facts in clinical data is a grand challenge, because medical practitioners and clinicians generally have exceptional knowledge in the problem domain in which they work, however, this knowledge is typically difficult to isolate computationally. To identify the desired surprising patterns, we formally record user knowledge and use that knowledge to filter and constrain the output from an objective data mining technique, with the user making the final judgement about whether a rule is surprising. Specifically, we introduce an unexpectedness algorithm based on association rule mining and Bayesian Networks and a E-explanations technique for explanation generation to identify unexpected patterns. An implemented prototype is successfully demonstrated using a large clinical database recording incidence, prevalance, and outcome of dialysis and kidney transplant patients.
引用
收藏
页码:111 / 138
页数:28
相关论文
共 50 条
  • [21] Mining ordinal patterns for data cleaning
    Liu, YB
    Liu, DY
    PROCEEDINGS OF THE 2004 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI-2004), 2004, : 438 - 443
  • [22] Mining ordinal patterns for data cleaning
    Liu, Y. B.
    Liu, D. Y.
    COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1267 - +
  • [23] Database support for data mining patterns
    Kotsifakos, E
    Ntoutsi, I
    Theodoridis, Y
    ADVANCES IN INFORMATICS, PROCEEDINGS, 2005, 3746 : 14 - 24
  • [24] Finding good-quality surprising patterns in time series data
    Department of Computer Science and Engineering, Northwestern Polytechnical University, Xi'an 710072, China
    Xibei Gongye Daxue Xuebao, 2007, 3 (425-428):
  • [25] Generating synthetic data to match data mining patterns
    Eno, Josh
    Thompson, Craig W.
    IEEE INTERNET COMPUTING, 2008, 12 (03) : 78 - 82
  • [26] A Dynamic Approach for Mining Generalised Sequential Patterns in Time Series Clinical Data Sets
    Shameem, M. Rasheeda
    Naseem, M. Razia
    Subanivedhi, N. K.
    Sethukkarasi, R.
    ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2, 2013, 177 : 667 - +
  • [27] Clinical Patterns of Obstructive Sleep Apnea and Its Comorbid Conditions: A Data Mining Approach
    Huang, Qi Rong
    Qin, Zhenxing
    Zhang, Shichao
    Chow, Chin Moi
    JOURNAL OF CLINICAL SLEEP MEDICINE, 2008, 4 (06): : 543 - 550
  • [28] Clinical Data-Mining
    Joelson, Richard B.
    SOCIAL WORK IN MENTAL HEALTH, 2010, 9 (01) : 71 - 72
  • [29] Introduction to the mining of clinical data
    Harrison, James H., Jr.
    CLINICS IN LABORATORY MEDICINE, 2008, 28 (01) : 1 - +
  • [30] Clinical Data-Mining
    Guzzetta, Charles
    JOURNAL OF TEACHING IN SOCIAL WORK, 2010, 30 (03) : 353 - 355