EPMS: A framework for large-scale patient matching

被引:0
|
作者
Singhal, Himanshu [1 ]
Ravi, Harish [1 ]
Chakravarthy, Sathiya Narayan [1 ]
Balasundaram, Prabavathy [1 ]
Babu, Chitra [1 ]
机构
[1] SSN Coll Engn, Dept CSE, Chennai, Tamil Nadu, India
关键词
Patient matching; Deduplication; Variational autoencoders; Blocking; Locality sensitive hashing; Fuzzy matching; RECORD LINKAGE;
D O I
10.1109/ICTAI.2019.00153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The healthcare industry, through digitization, is trying to achieve interoperability, but has not been able to achieve complete Health Information Exchange (HIE). One of the major challenges in achieving this is the inability to accurately match patient data. Mismatching of patient records can lead to improper treatment which can prove to be fatal. Also, the presence of duplicate overheads has caused inaccessibility to crucial information in the time of need. Existing solutions to patient matching are both time-consuming and non-scalable. This paper proposes a framework, namely, Electronic Patient Matching System (EPMS), which attempts to overcome these barriers while achieving a good accuracy in matching patient records. The framework encodes the patient records using variational autoencoder and amalgamates them by performing locality sensitive hashing on an Apache spark cluster. This makes the process faster and highly scalable. Furthermore, a fuzzy matching of the records in each block is performed using Levenshtein distances to identify the duplicate patient records. Experimental investigations were performed on a synthetically generated dataset consisting of 44555 patient records. The proposed framework achieved a matching accuracy of 81.15% on this dataset.
引用
收藏
页码:1096 / 1101
页数:6
相关论文
共 50 条
  • [1] A Framework for Efficient Matching of Large-Scale Metadata Models
    Moawed, Seham
    Algergawy, Alsayed
    Sarhan, Amany
    Eldosouky, Ali
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (04) : 3117 - 3135
  • [2] A Framework for Efficient Matching of Large-Scale Metadata Models
    Seham Moawed
    Alsayed Algergawy
    Amany Sarhan
    Ali Eldosouky
    [J]. Arabian Journal for Science and Engineering, 2019, 44 : 3117 - 3135
  • [3] Large-scale Boolean Matching
    Katebi, Hadi
    Markov, Igor L.
    [J]. 2010 DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2010), 2010, : 771 - 776
  • [4] FTRLIM: Distributed Instance Matching Framework for Large-Scale Knowledge Graph Fusion
    Zhu, Hongming
    Wang, Xiaowen
    Jiang, Yizhi
    Fan, Hongfei
    Du, Bowen
    Liu, Qin
    [J]. ENTROPY, 2021, 23 (05)
  • [5] Efficient Large-Scale Stereo Matching
    Geiger, Andreas
    Roser, Martin
    Urtasun, Raquel
    [J]. COMPUTER VISION-ACCV 2010, PT I, 2011, 6492 : 25 - +
  • [6] Large-Scale Collective Entity Matching
    Rastogi, Vibhor
    Dalvi, Nilesh
    Garofalakis, Minos
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (04): : 208 - 218
  • [7] Map Matching Algorithm for Large-scale Datasets
    Fiedler, David
    Cap, Michal
    Nykl, Jan
    Zilecky, Pavol
    [J]. ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 3, 2022, : 500 - 508
  • [8] Survey on large-scale graph pattern matching
    Yu, Jing
    Liu, Yanbing
    Zhang, Yu
    Liu, Mengya
    Tan, Jianlong
    Guo, Li
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (02): : 391 - 409
  • [9] Large-scale biomedical ontology matching with ServOMap
    Ba, M.
    Diallo, G.
    [J]. IRBM, 2013, 34 (01) : 56 - 59
  • [10] A Distributed Algorithm for Large-Scale Generalized Matching
    Manshadi, Faraz Makari
    Awerbuch, Baruch
    Gemulla, Rainer
    Khandekar, Rohit
    Mestre, Julian
    Sozio, Mauro
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (09): : 613 - 624