Enhancing Multi-Attribute Similarity Join using Reduced and Adaptive Index Trees

被引:0
|
作者
Silva, Vitor Bezerra [1 ]
Nascimento, Dimas Cassimiro [1 ]
机构
[1] Univ Fed Agreste Pernambuco, Ave Bom Pastor, BR-55292270 Garanhuns, Pe, Brazil
关键词
Similarity Join; Index Tree; Filter selection; Feature selection; SEARCH;
D O I
10.1007/s10115-024-02089-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Attribute Similarity Join represents an important task for a variety of applications. Due to a large amount of data, several techniques and approaches were proposed to avoid superfluous comparisons between entities. One of these techniques is denominated Index Tree. In this work, we proposed an adaptive version (Adaptive Index Tree) of the state-of-the-art Index Tree for multi-attribute data. Our method selects the best filter configuration to construct the Adaptive Index Tree. We also proposed a reduced version of the Index Trees, aiming to improve the trade-off between efficacy and efficiency for the Similarity Join task. Finally, we proposed Filter and Feature selectors designed for the Similarity Join task. To evaluate the impact of the proposed approaches, we employed five real-world datasets to perform the experimental analysis. Based on the experiments, we conclude that our reduced approaches have produced superior results when compared to the state-of-the-art approach, specially when dealing with datasets that present a significant number of attributes and/or and expressive attribute sizes.
引用
下载
收藏
页码:4251 / 4281
页数:31
相关论文
共 50 条
  • [41] Multi-attribute decision-making using q-rung orthopair fuzzy Zagreb index
    Rao, Yongsheng
    Kosari, Saeed
    Hameed, Saira
    Yousaf, Zulqarnain
    Artificial Intelligence Review, 2025, 58 (05)
  • [42] Accident hazard index: A multi-attribute method for process industry hazard rating
    Khan, FI
    Abbasi, SA
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 1997, 75 (B4) : 217 - 224
  • [43] A multi-attribute Systemic Risk Index for comparing and prioritizing chemical industrial areas
    Reniers, G. L. L.
    Soerensen, K.
    Dullaert, W.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2012, 98 (01) : 35 - 42
  • [44] AdaTag: Multi-Attribute Value Extraction from Product Profiles with Adaptive Decoding
    Yan, Jun
    Zalmout, Nasser
    Liang, Yan
    Grant, Christan
    Ren, Xiang
    Dong, Xin Luna
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4694 - 4705
  • [45] Multi-attribute Index Processing Method of Target Threat Assessment in Ground Combat
    Kong D.-P.
    Chang T.-Q.
    Hao N.
    Zhang L.
    Guo L.-B.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (01): : 161 - 172
  • [46] USING FUZZY SET APPROACH IN MULTI-ATTRIBUTE AUTOMATED AUCTIONS
    Goyal, Madhu
    Kaushik, Saroj
    ICEIS 2010: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 4: SOFTWARE AGENTS AND INTERNET COMPUTING, 2010, : 81 - 85
  • [47] Visualizing multi-attribute web transactions using a freeze technique
    Hao, MC
    Cotting, D
    Dayal, U
    Machiraju, V
    Garg, P
    VISUALIZATION AND DATA ANALYSIS 2003, 2003, 5009 : 153 - 159
  • [48] PROBABILISTIC RANKING OF MULTI-ATTRIBUTE ITEMS USING INDIFFERENCE CURVE
    Gong, Xiaohui
    Zhao, H. Vicky
    Sun, Yan Lindsay
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [49] Multi-attribute Evaluation and Optimal Selection Using Information Axiom
    Cheng, X. F.
    Liu, P. A.
    Chen, C.
    MANUFACTURING SCIENCE AND ENGINEERING, PTS 1-5, 2010, 97-101 : 3523 - 3526
  • [50] Adaptive-expectation based multi-attribute FTS model for forecasting TAIEX
    Liu, Jing-Wei
    Chen, Tai-Liang
    Cheng, Ching-Hsue
    Chen, Yao-Hsien
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2010, 59 (02) : 795 - 802