PROM: Efficient matching query processing on high-dimensional data

被引:0
|
作者
Ma, Chunyang [1 ]
Zhou, Yongluan [2 ]
Shou, Lidan [3 ]
Chen, Gang [3 ]
机构
[1] IBM Res Corp, Beijing, Peoples R China
[2] Univ Southern Denmark, Dept Math & Comp Sci, Copenhagen, Denmark
[3] Zhejiang Univ, Dept Comp Sci, Hangzhou, Zhejiang, Peoples R China
关键词
Index; High-dimensional; Matching; CLOSEST-PAIR QUERIES; RECOMMENDATION;
D O I
10.1016/j.ins.2015.05.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In many applications, such as online dating or job hunting websites, users often need to search for potential matches based on the requirements or preferences imposed by both sides. We refer to this type of queries as matching queries. In spite of their wide applicabilities, there has been little attention devoted to improving their performance. As matching queries often appear in various forms even within a single application, we, in this paper, propose a general processing framework, which can efficiently process various forms of matching queries. Moreover, we illustrate the applicability of this framework by elaborating the detailed processing algorithms of one particular matching query and its extensions to two other forms of matching queries. We conduct an extensive experimental study with both synthetic and real datasets. The results indicate that, for various matching queries, our techniques can highly improve the query performance, especially when the dimensionality is high. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [1] Efficient Parallel Skyline Query Processing for High-Dimensional Data
    Tang, Mingjie
    Yu, Yongyang
    Aref, Walid G.
    Malluhi, Qutaibah M.
    Ouzzani, Mourad
    [J]. 2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 2113 - 2114
  • [2] Efficient Parallel Skyline Query Processing for High-Dimensional Data
    Tang, Mingjie
    Yu, Yongyang
    Aref, Walid G.
    Malluhi, Qutaibah M.
    Ouzzani, Mourad
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (10) : 1838 - 1851
  • [3] Similarity Query Processing for High-Dimensional Data
    Qin, Jianbin
    Wang, Wei
    Xiao, Chuan
    Zhang, Ying
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2020, 13 (12): : 3437 - 3440
  • [4] An efficient algorithm for hyperspherical range query processing in high-dimensional data space
    Lee, DH
    Heu, S
    Kim, HJ
    [J]. INFORMATION PROCESSING LETTERS, 2002, 83 (02) : 115 - 123
  • [5] High-Dimensional Similarity Query Processing for Data Science
    Qin, Jianbin
    Wang, Wei
    Xiao, Chuan
    Zhang, Ying
    Wang, Yaoshu
    [J]. KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4062 - 4063
  • [6] MUD: Mapping-based query processing for high-dimensional uncertain data
    Shou, Lidan
    Zhang, Xiaolong
    Chen, Gang
    Gao, Yuan
    Chen, Ke
    [J]. INFORMATION SCIENCES, 2012, 198 : 147 - 168
  • [7] Adaptive quantization of the high-dimensional data for efficient KNN processing
    Cui, B
    Hu, J
    Shen, HT
    Yu, C
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2004, 2973 : 302 - 313
  • [8] Efficient index-based KNN join processing for high-dimensional data
    Yu, Cui
    Cui, Bin
    Wang, Shuguang
    Su, Jianwen
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2007, 49 (04) : 332 - 344
  • [9] Efficient Learning on High-dimensional Operational Data
    Samani, Forough Shahab
    Zhang, Hongyi
    Stadler, Rolf
    [J]. 2019 15TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2019,
  • [10] Efficient Outlier Detection for High-Dimensional Data
    Liu, Huawen
    Li, Xuelong
    Li, Jiuyong
    Zhang, Shichao
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 48 (12): : 2451 - 2461