Dynamic Sorted Neighborhood Indexing for Real-Time Entity Resolution

被引:0
|
作者
Ramadan, Banda [1 ]
Christen, Peter [1 ]
Liang, Huizhi [1 ]
机构
[1] Australian Natl Univ, Coll Engn & Comp Sci, Res Sch Comp Sci, Canberra, ACT 0200, Australia
关键词
Dynamic indexing; data matching; braided tree;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time entity resolution is the process of matching query records in sub-second time with records in a database that represent the same real-world entity. Indexing techniques are used to efficiently extract a set of candidate records from the database that are similar to a query record, and that are then compared with the query record in more details. The sorted neighborhood indexing method, which sorts a database and compares records within a sliding window, has successfully been used for entity resolution of very large databases. However, because it is based on static sorted arrays, this technique is not suitable for dynamic databases. We propose a tree-based dynamic sorted neighborhood index that facilitates matching a stream of query records against a large and dynamic database in real-time. We evaluate our approach on two large data sets. Our results show that the times for both inserting and querying of records stays nearly constant as the index grows, and our approach achieves over one magnitude faster indexing and querying times compared to an earlier real-time entity resolution technique with comparable high matching accuracy.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [41] A Framework for Dynamic Real-Time Reconfiguration
    Reis, Joao Gabriel
    Frohlich, Antonio Augusto
    Wanner, Lucas
    2015 EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN (DSD), 2015, : 255 - 258
  • [42] DYNAMIC ADAPTATION OF REAL-TIME SOFTWARE
    BIHARI, TE
    SCHWAN, K
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1991, 9 (02): : 143 - 174
  • [43] Real-time imaging of dynamic tissues
    Joan E. Nichols
    Sasha R. Azar
    Nature Methods, 2023, 20 : 1631 - 1632
  • [44] Real-time imaging of dynamic tissues
    Nichols, Joan E.
    Azar, Sasha R.
    NATURE METHODS, 2023, 20 (11) : 1631 - 1632
  • [45] Real-Time PageRank on Dynamic Graphs
    Sallinen, Scott
    Luo, Juntong
    Ripeanu, Matei
    PROCEEDINGS OF THE 32ND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2023, 2023, : 239 - 251
  • [46] Real-time imaging of dynamic tissues
    Nichols, Joan E.
    Azar, Sasha R.
    NATURE METHODS, 2023,
  • [47] Dynamic Attestation of Real-Time Systems
    Potthoff, Travis
    Graham, Scott
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON CYBER WARFARE AND SECURITY (ICCWS 2017), 2017, : 489 - 496
  • [48] Dynamic integrated scheduling of hard real-time, soft real-time and non-real-time processes
    Brandt, SA
    Banachowski, S
    Lin, CX
    Bisson, T
    RTSS 2003: 24TH IEEE INTERNATIONAL REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 2003, : 396 - 407
  • [49] HDRM: A Resolution Complete Dynamic Roadmap for Real-Time Motion Planning in Complex Scenes
    Yang, Yiming
    Merkt, Wolfgang
    Ivan, Vladimir
    Li, Zhibin
    Vijayakumar, Sethu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (01): : 551 - 558
  • [50] Real-Time Rendering of Dynamic Clouds Using Multi-Resolution Adaptive Grids
    范晓磊
    张立民
    钟兆根
    Transactions of Nanjing University of Aeronautics and Astronautics, 2015, 32 (04) : 428 - 437