Scalable algorithms for signal reconstruction by leveraging similarity joins

被引:0
|
作者
Asudeh, Abolfazl [1 ]
Augustine, Jees [2 ]
Nazi, Azade [3 ]
Thirumuruganathan, Saravanan [4 ]
Zhang, Nan [5 ]
Das, Gautam [2 ]
Srivastava, Divesh [6 ]
机构
[1] Univ Illinois, Chicago, IL 60607 USA
[2] Univ Texas Arlington, Arlington, TX 76019 USA
[3] Google AI, Mountain View, CA USA
[4] HBKU, QCRI, Ar Rayyan, Qatar
[5] Penn State Univ, State Coll, PA USA
[6] AT&T Labs Res, Florham Pk, NJ USA
来源
VLDB JOURNAL | 2020年 / 29卷 / 2-3期
关键词
Signal reconstruction; Traffic reconstruction; Underdetermined systems; Scalable algorithm; RECOVERY;
D O I
10.1007/s00778-019-00562-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Signal reconstruction problem (SRP) is an important optimization problem where the objective is to identify a solution to an underdetermined system of linear equations that is closest to a given prior. It has a substantial number of applications in diverse areas including network traffic engineering, medical image reconstruction, acoustics, astronomy and many more. Most common approaches for SRP do not scale to large problem sizes. In this paper, we propose multiple optimization steps, developing scalable algorithms for the problem. We first propose a dual formulation of the problem and develop the Direct algorithm that is significantly more efficient than the state of the art. Second, we show how adapting database techniques developed for scalable similarity joins provides a significant speedup over Direct, scaling our proposal up to large-scale settings. Third, we describe a number of practical techniques that allow our algorithm to scale to settings of size in the order of a million by a billion. We also adapt our proposal to identify the top-k components of the solved system of linear equations. Finally, we consider the dynamic setting where the inputs to the linear system change and propose efficient algorithms inspired by the database techniques of materialization and reuse. Extensive experiments on real-world and synthetic data confirm the efficiency, effectiveness and scalability of our proposal.
引用
收藏
页码:681 / 707
页数:27
相关论文
共 50 条
  • [1] Scalable algorithms for signal reconstruction by leveraging similarity joins
    Abolfazl Asudeh
    Jees Augustine
    Azade Nazi
    Saravanan Thirumuruganathan
    Nan Zhang
    Gautam Das
    Divesh Srivastava
    The VLDB Journal, 2020, 29 : 681 - 707
  • [2] Leveraging Similarity Joins for Signal Reconstruction
    Asudeh, Abolfazi
    Nazi, Azade
    Augustine, Jees
    Thirumuruganathan, Saravanan
    Zhang, Nan
    Das, Gautam
    Srivastava, Divesh
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (10): : 1276 - 1288
  • [3] Scalable Similarity Joins of Tokenized Strings
    Metwally, Ahmed
    Huang, Chun-Heng
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1766 - 1777
  • [4] Efficient and Scalable Graph Similarity Joins in MapReduce
    Chen, Yifan
    Zhao, Xiang
    Xiao, Chuan
    Zhang, Weiming
    Tang, Jiuyang
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [5] Fast and scalable vector similarity joins with MapReduce
    Byoungju Yang
    Hyun Joon Kim
    Junho Shim
    Dongjoo Lee
    Sang-goo Lee
    Journal of Intelligent Information Systems, 2016, 46 : 473 - 497
  • [6] Practising Scalable Graph Similarity Joins in MapReduce
    Chen, Yifan
    Zhao, Xiang
    Ge, Bin
    Xiao, Chuan
    Chi, Chi-Hung
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 112 - 119
  • [7] Fast and scalable vector similarity joins with MapReduce
    Yang, Byoungju
    Kim, Hyun Joon
    Shim, Junho
    Lee, Dongjoo
    Lee, Sang-goo
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 46 (03) : 473 - 497
  • [8] High dimensional similarity joins: Algorithms and performance evaluation
    Koudas, N
    Sevcik, KC
    14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, : 466 - 475
  • [9] Output-optimal Parallel Algorithms for Similarity Joins
    Hu, Xiao
    Tao, Yufei
    Yi, Ke
    PODS'17: PROCEEDINGS OF THE 36TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2017, : 79 - 90
  • [10] High dimensional similarity joins: Algorithms and performance evaluation
    Koudas, N
    Sevcik, KC
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2000, 12 (01) : 3 - 18