Improved scalability in mining using ontology record linkage algorithm

被引:0
|
作者
Prabhu, T. [1 ]
Dhas, C. Suresh Gnana [2 ]
机构
[1] Manonmaniam Sundaranar Univ, Dept Comp Sci & Engn, Thirunelveli 627012, Tamil Nadu, India
[2] Vivekanadha Coll Engn Women, Dept Comp Sci & Engn, Tiruchengode 637205, Tamil Nadu, India
关键词
Record linkage; Data mining; Angle based neighborhood; Ontology; Conventional method; INJURIES;
D O I
10.1016/j.compeleceng.2018.01.026
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Record linkage offers wide role in record identification and relevant datasets matching. The conventional researchers use probabilistic approach to identify reliable and unique datasets. Record linkage with probabilistic approach exploits data, which are common to an individual record pair. Classical methods have equality based record linkage in common fields. Therefore, errors associated with record linkage reduce the scalability. In this paper, a similarity between individual values of record pairs is improved using ontology-based semantic similarity model. Semantic similarity between the records is tested successfully using angle based neighborhood graph. To validate the proposed approach, a conventional record linkage algorithm is compared with angle based neighborhood ontology record linkage technique, which achieves improved accuracy and scalability. Finally, the accuracy of identifying similar semantic matches is more scalable in proposed technique than conventional methods. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:511 / 519
页数:9
相关论文
共 50 条
  • [1] Improved quality of tuberculosis data using record linkage
    Bartholomay, Patricia
    de Oliveira, Gisele Pinto
    Pinheiro, Rejane Sobrino
    Nogales Vasconcelos, Ana Maria
    [J]. CADERNOS DE SAUDE PUBLICA, 2014, 30 (11): : 2459 - 2469
  • [2] Using an interest ontology for improved support in rule mining
    Chen, XM
    Zhou, X
    Scherl, R
    Geller, J
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2003, 2737 : 320 - 329
  • [3] Privacy Preserving Record Linkage using MetaSoundex Algorithm
    Koneru, Keerthi
    Varol, Cihan
    [J]. 2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 443 - 447
  • [4] An ontology-based record linkage method for textual microdata
    Martinez, Sergio
    Valls, Aida
    Sanchez, David
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2011, 232 : 130 - 139
  • [5] Effective record linkage for mining campaign contribution data
    Giraud-Carrier, C.
    Goodliffe, J.
    Jones, B. M.
    Cueva, S.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2015, 45 (02) : 389 - 416
  • [6] Effective record linkage for mining campaign contribution data
    C. Giraud-Carrier
    J. Goodliffe
    B. M. Jones
    S. Cueva
    [J]. Knowledge and Information Systems, 2015, 45 : 389 - 416
  • [7] FIRLA: a Fast Incremental Record Linkage Algorithm
    Soliman, Ahmed
    Rajasekaran, Sanguthevar
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 130
  • [8] Mining Frequent Itemsets in Association Rule Mining Using Improved SETM Algorithm
    Hanirex, D. Kerana
    Kaliyamurthie, K. P.
    [J]. ARTIFICIAL INTELLIGENCE AND EVOLUTIONARY COMPUTATIONS IN ENGINEERING SYSTEMS, ICAIECES 2015, 2016, 394 : 765 - 773
  • [9] Implementation of an Improved Algorithm for Frequent Itemset Mining using Hadoop
    Agarwal, Ruchi
    Singh, Sunny
    Vats, Satvik
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 13 - 18
  • [10] Frequent Itemset Mining using Improved Apriori Algorithm with MapReduce
    Tribhuvan, Seema A.
    Gavai, Nitin R.
    Vasgi, Bharti P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2017,